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EFFECT OF PSYCHOLOGICAL FEEDBACK 
UPON WORK DECREMENT 


R. B. PAYNE AND G. T. HAUTY 
USAF School of Aviation Medicine 


A recent review of the literature by 
Ammons leaves little doubt that some 
form of psychological feedback is 
beneficial to the learning and per- 
formance of a wide variety of tasks 
(1). In fact, this principle has been 
so thoroughly confirmed in its elemen- 
tary form that it has become a 
fundamental consideration in the de- 
sign of training programs and equip- 
ment (4), and its incorporation by 
general behavior theory poses a major 
challenge. Nevertheless, one could 
scarcely say that all the fundamental 
questions about psychological feed- 
back have been resolved. Optimal 
understanding and use of the principle 
in the control of behavior await 
further work upon such problems as 
the content, latency, and specificity of 
the return; the immediacy and endur- 
ance of its effects; and the interaction 
of such effects with proficiency level, 
work duration, cerebral efficiency, and 
many other conditions of task per- 
formance upon which the return may 
be imposed. 

There is a notable poverty of 
evidence concerning the effects of 
feedback upon prolonged performance. 
Does a supplementary return promote 
proficiency indefinitely? Does it ar- 
rest or forestall work decrement in 
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prolonged repetitive tasks? Opinions 
on these questions are divided. Bart- 
lett, for example, has said that the 
most efficient offset to “skill fatigue”’ 
so far demonstrated is some means 
whereby S can periodically check his 
performance against a standard (2). 
He argues that without such knowl- 
edge S may unknowingly lower his 
standard so that his output, although 
deteriorating, may nevertheless seem 
acceptable. On the other hand, 
Hauty and Payne found that Ss ap- 
prised of substandard performance by 
supplementary auditory or visual cues 
were unable to forestall work decre- 
ment (3). The signals boosted pro- 
ficiency for the entire period of 7 hr., 
but they did not affect the rate of 
decline in any favorable way. 

In a work decrement situation it is 
possible, of course, that the efficacy of 
feedback will depend upon its rele- 
vance to whatever factors are re- 
sponsible for the deficit. If the 
decrement is fundamentally a matter 
of motivational failure, a return 
having predominantly incentive prop- 
erties should allay the onset of decline. 
On the other hand, if the decrement 
is largely a matter of organizational 
impairment, a return having pre- 
dominantly directive properties should 
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provide an effective countermeasure. 
In actuality, work decrement almost 
certainly involves both these factors, 
as well as a great many others, and 
this raises the interesting possibility 
that its arrest might depend upon the 
additivity of different returns applied 
concurrently. Such notions. cannot 
be easily tested, for they involve two 
principal uncertainties—the  etio- 
logical details of work decrement, and 
the stimulus functions of the returns. 
However, it is at least possible to 
arrange the form and content of re- 
turns so that an operational distinc- 
tion between cue properties and drive 
properties can be made on logical and 
intuitive grounds. For example, a 
very prompt and specific designation 
of the fact and locus of error might be 
said to promote proficiency largely 
because of its directive properties, 
even though the cues might acquire 
some drive value. By contrast, a 
tardy, infrequent signal which merely 
apprises S of substandard perform- 
ance without suggesting some specific 
corrective action, if beneficial to per- 
formance, would seem so_ largely 
because of its incentive properties. 
Evidence of significant, orthogonal 
effects attributable to the two types 
of signals would favor the distinction 
drawn between them on rational 
grounds. 

The foregoing considerations led 
to the design of an experiment which 
explored the single and joint effects of 
directive and incitive techniques upon 
work decrement in a complicated com- 
pensatory pursuit task. These effects 
were examined across different levels 
of cerebral efficiency induced by 
analeptic and depressant drugs. 


PROCEDURE 


Subjects —The Ss were 144 volunteer, ex- 
perimentally naive, healthy, rested, basic airmen 
whose ages ranged narrowly around 20 yr. 


Task.—The USAF SAM Multidimensional 


Pursuit Test (CM813E), described in previous 
communications (3, 5), was used. Briefly, this 
task involves the scanning of four cam-driven 
simulated aircraft instruments and the counter- 
action of their drifts by the coordinated adjust- 
ment of simulated aircraft controls. The per- 
formance schedule involved a_ preliminary 
practice period consisting of 40 1-min. trials, a 
10-min. rest period during which experimental 
factors were introduced, and a final work period 
consisting of 192 1-min. trials (4 hr.). Score, 
consisting of the time during which all four 
pointers were kept on null concurrently, was 
recorded for each of the five pretreatment cycles 
and each of the 24 posttreatment cycles (1 cycle 
= § trials). 

Experimental conditions.—Directive feedback, 
which will be called Tuition (T), consisted of 
three methods of apprising S whenever any of 
the four instrument pointers drifted from its 
null area. The control level (T;) consisted of no 
prescribed warning at all, i.e., the deviations 
were detectable only through Ss’ own scanning 
procedure. The second level (T2) provided, in 
addition to Ti, a single peripheral visual signal 
whenever a deviation occurred, but it did not 
specify the deviant instrument. The third level 
(T;) provided, in addition to Ti, individual 
instrument signals that specified both the fact 
and the locus of deviation. The supplementary 
signals were triggered instantaneously by the 
deviations. 

Incitive feedback, which will be called Moti- 
vation (M), provided Ss with varying amounts 
of information concerning their progress at the 
conclusion of each cycle of eight trials. The 
control level (M:) consisted of no prescribed in- 
formation at all other than a very general, hazy 
notion of progress which S might form spontane- 
ously. The second level (M2) provided S with 
knowledge of his standing relative to a perform- 
ance level represented to him as a group norm. 
This information was gauged to the nearest o/4, 
and it pertained only to the single cycle then 
completed. The third level (M3) included the 
second, but it also allowed S’s standings on 
previous cycles to remain in view. The relative 
standings were displayed by differently colored 
pegs inserted into a wall board drilled with 24 
(cycles) X 24 (¢/4) holes. The absolute value 
of the group standard on each cycle was set at 
lo above the cycle mean which had been cal- 
culated from normative data, although S was 
never aware of this because the relative standings 
were always displayed in standard score units 
taken by E from a conversion table. Thus, the 
group standard represented to Ss was actually 
that which would be expected to be equaled or 
surpassed by about 16% of the parent pop- 
ulation. 

Cerebral efficiency was varied by means of 
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TABLE 1 


ANALYSIS OF VARIANCE OF ADJUSTED 
PosTTREATMENT SCORES 




















Source df — F 
Tuition 2 | 5,071,246| 78.96* 
Motivation 2 669,336} 10.42* 
Drugs 3 | 4,032,156] 62.78* 
TXM 4 116,021 1.81 
ie 8 6 125,073 1.95 
MxD 6 83,790 1.30 
TXMxXD 12 94,805 1.48 
Ss treated alike 108 64,226 
Cycles 23 226,621 | 116.81* 
Cx T 46 2,153 1.11 
CxXM 46 3,522 1.81* 
CxD 69 44,140} 22.75* 
Cx TxD 138 3,525 1.82* 
CxXMxD 138 1,568 | <1.00 
Cx Tx 92 2,273 1.17 
ee ee & ai 276 1,901 | <1.00 
Pooled interaction 

Ss X Cycles 2,484 1,940 
Total 3,455 














* Significant ratios, p < .005. 


clinical doses of analeptic and depressant drugs 
(D). The lowest level (D,) was induced by a 
mixture of .65 mg. hyoscine hydrobromide with 
50 mg. diphenhydramine hydrochloride (Bena- 
dryl-hyoscine). Control levels were achieved by 
no drug at all (D,) and lactose placebo (D2). 
The highest level (D3) was induced by a 5 mg. 
dose of dextro-amphetamine sulfate (Dexedrine). 
All preparations were delivered in No. 1 pink 


“capsules during the interval between preliminary 


and final practice periods. 

The organization of the experiment followed 
a multiple-factor design consisting of 3 X 3 K 4 
= 36 treatment combinations. The serial order 
of the combinations was randomized across 36 
testing days. On any one day four Ss were 
drawn randomly from the population of volun- 
teers, assigned to the combination selected for 
that day, and tested simultaneously in different 
rooms under essentially identical conditions. 

At the conclusion of the final work period, 
Ss completed a questionnaire designed to gain 
information about their task attitudes and other 
dispositions, as reported in a previous investi- 
gation (5). 


ReEsuLts AND Discussion 


The 3,456 scores (144 Ss X 24 
cycles) were adjusted for regression 
upon pretreatment achievement lev- 


els. Their variation was then ana- 
lyzed as shown in Table 1, where 
Tuition, Motivation, and Drugs con- 
stitute the main experimental vari- 
ables, and Cycles are treated as 
replications. 

Directive feedback (Tuition).—It is 
apparent from Table 1 that the 
variation associated with directive 
feedback is highly significant. A plot 
of the general means for each treat- 
ment, shown in the upper panel of 
Fig. 1, and the F ratio for the C X T 
interaction indicate that the treat- 
ment effects are essentially uniform 
across all cycles. Tests of signi- 
ficance' applied to treatment differ- 
ences show that T; performance is 
superior to T, performance across all 
cycles (p < .01), and that T, per- 
formance is superior to T, perform- 
ance across all cycles (p < .01). 

Despite the apparent generality of 
the directive feedback effect, a further 
study of Table 1 indicates that the 
treatment differences are not uniform 
from one drug group to another 
throughout all cycles (CX TX D 
interaction). This state of affairs is 
elaborated in Fig. 2, which plots the 
cycle means for each T X D com- 
bination. When analyzed by ap- 
propriate tests of significance the 
C X T X D interaction is found due 
largely to the progressive depreciation 
of feedback effects under the Benadryl- 
hyoscine condition (p 2 .08 for 23 of 
the 24 T X D, subclass mean com- 
parisons on the final 8 cycles). Toa 
smaller degree the interaction is due 
also to the fact that the T; effects are 
not significantly greater than the T; 
effects under the Dexedrine condition 
beyond the Cycle 16 (p 2 .11 for the 
eight subclass mean comparisons). 
The generality of the effect holds, 
however, for all pharmacological con- 
ditions over the first 16 cycles, and 


1 All ¢ tests cited in this study were double 
tailed. 
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for both the pharmacologically neutral 
conditions throughout all 24 cycles. 


The foregoing evidence shows con- 
clusively that the undrugged S’s pro- 
ficiency is raised uniformly throughout 
the entire 4 hr. of work by instantaneous 
warnings of performance error. Since 
the increment is a positive function of 
the specificity of the information sup- 


Effects of principal treatments upon work decrement. 


plied, it can be ascribed largely to the 
directive properties of the cues, i.e., the 
cues impose a more efficient temporal and 
spatial organization upon S’s scanning 
behavior. Curiously enough, however, 
the cues failed to offset performance 
decrement relative to the initial gains 
which they produced. When S’s cere- 


bral efficiency is altered by analeptic or 
depressant drugs, the stimulus functions 
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of the cues undergo a change in the 
terminal stages of performance. In 
cerebral analepsis the locus-specifying 
function of the cues provides no sig- 
nificant advantage over that which 
merely indexes the fact of error, a result 
which is not surprising if analepsis per 
se allows S to work at or near maximum 
capability. In cerebral depression both 
types of directive returns gradually lose 
their regulatory functions, perhaps be- 
cause of raised excitation thresholds 
induced by the anticholinergic properties 
of the Benadryl-hyoscine mixture and by 
the prolongation of work. 


Incitive feedback (Motivation).— 
Reference to Table 1 again shows that 
the variation attributable to incitive 
feedback treatments is very signifi- 
cant. These effects are orthogonal to 
those arising from the other two ex- 
perimental variables, but the C K M 
interaction suggests that they are not 
uniform throughout the entire post- 
treatment period. The plot of 
general treatment means shown in the 
central panel of Fig. 1 will aid in the 
visualization of this relationship. An 
appraisal of treatment differences for 
each cycle shows that M; means 
are significantly superior to My, 
means from the second cycle onward 
(p S .04). Noeffect, of course, could 
have been exerted by these treatments 
during the first 10 min. because they 
were applied only at the conclusion of 
cycles. The Ms means are signifi- 
cantly superior to the M,; means for 
Cycles 2-14 (p = .05), but thereafter 
the differences are generally not 
significant. The M3; means are gener- 
ally not significantly superior to the 
Mz means. 

Since this type of feedback provided 
neither specific nor timely cues to 
what S should do in order to maintain 
his proficiency, its effect could scarcely 
be attributed to directive properties. 
Rather, it would seem more reason- 
ably endowed with drive value.arising, 


in the case of Mo, from the observed 
relationship to the group standard, 
and, in the case of M3, from this 
source plus the observed relationship 
to previous levels. The latter source, 
of course, was a _ negligible spur 
relative to the former during the first 
14 cycles, but thereafter it became 
essential to the success of the return. 


Certain similarities and differences in 
the behavior of directive and incitive 
returns are now readily apparent. Or- 
thogonal to one another, both have im- 
mediate positive effects upon perform- 
ance. Relative to these initial gains, 
however, they fail to arrest work decre- 
ment for any appreciable length of time, 
either singly or in combination. They 
are distinguishable in terms of the way 
in which they behave in the presence of 
other variables in the experiment. For 
example, the effects of directive returns 
are profoundly modified by extremes of 
cerebral efficiency, while those arising 
from incitive returns are not. Also, the 
effects of incitive returns are diminished 
by task duration, while those arising 
from directive returns are not. The 
orthogonality and distinguishability of 
the two types of return constitute 
empirical grounds for the different stim- 
ulus functions which were imputed to 
them earlier. 


Drugs.—Since the differential ef- 
fects of the cortical stimulant and 
depressant used in this study have 
been demonstrated repeatedly, there 
is little point to any prolonged con- 
sideration of them here (3, 5). The 
F ratio in Table 1 shows Drugs to be a 
significant source of variation, the 
uniformity of which varies with cycles 
(C X D interaction). The general 
treatment means plotted across cycles 
are shown in the lower panel of Fig. 1. 
During the first 30 min. of work there 
are no significant treatment differ- 
ences. Thereafter the depressant 
curve drops sharply and remains sig- 
nificantly under all others (p < .01), 
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while the analeptic curve becomes and 
remains significantly superior to all 
others (p < .01). The C X D inter- 
action, therefore, reflects largely the 
fact that the performance effects of 
drugs must await the completion of 
internal absorption and distribution 
processes. 
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action has already been elaborated in 
terms of directive feedback treat- 
ments within drugs across the cycles, 
there is some merit to a further 
examination of it in terms of the 
differences between drugs. For this 
purpose the D means within the T 
subclasses have been plotted across 
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propriate tests of significance applied 
to D differences under the T, condi- 
tion indicate that, after the lapse of 
absorption-distribution time (approxi- 
mately 40 min.), the Ds; means are 
significantly superior to all others 
and the D, means are significantly 
inferior to all others (p S .01). Under 
the T; condition all means are shifted 
upward during the early cycles, but 
the D, means decline more rapidly 
than before, reaching the level ex- 
hibited under T; in about 2 hr. By 
contrast, the de'ine of the pharmaco- 
logically neutrai means is somewhat 
delayed, in consequence of which the 
D; superiority is neither as prominent 
nor as prolonged as under the T, con- 
dition. The D3; means are signifi- 
cantly higher than D, means in only 
10 of the 24 cycles (p S .05), and 
they are significantly higher than D» 
means in only 18 of the 24 cycles 
(p = .05). Essentially comparable 
outcomes are found under the T; 
condition. Evidently, therefore, the 
extremes of cerebral efficiency induced 
by the pharmacological preparations 
placed certain limitations upon the 
increments in proficiency which could 
be realized from directive feedback. 
They seem to have defined a range of 
capacity within which proficiency 
variation due to other influences could 
occur. Incitive feedback effects did 
not interact with drug effects across 
the cycles probably only because they 
were relatively less profound than 
directive feedback effects, i.e., the 
increment induced by them fell well 
within this range of capacity imposed 
by the drugs. 

Cycles —Changes in performance 
from cycle to cycle are generally quite 
significant, and they depend in a very 
complicated wavy upon the various 
experimental treatments and their 
combinations. The trend of the gen- 
eral means, of course, is downward; 
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but this trend is affected markedly by 
drugs, and to a smaller but still sig- 
nificant extent by incitive feedback 
and by D XT combinations. For 
example, consider the drug influences. 
Under the depressant condition there 
is a profound rate of decline during 
the first 120 min.; and the more the 
performance is raised initially by 
directive feedback, the greater the 
rate of decline (see bottom panel of 
Fig. 2). By contrast, the analeptic 
condition induces an increment over 
the first 90 min., and the only sig- 
nificant declines observed later are 
relative to the peak of this increment. 
Under none of the D; X T combina- 
tions was the output of any later cycle 
significantly lower than that displayed 
during the first 20 min. of posttreat- 
ment work. Such declines as were 
exhibited under the analeptic condi- 
tion, therefore, were a positive func- 
tion of the singular and joint incre- 
ments induced by the drug and the 
directive feedback treatments. It 
follows from this and previous evi- 
dence that the burden of performance 
sustentation was borne by the ana- 
leptic treatment. The C X M inter- 
action is due mainly to the fact that 
incitive treatments induce an initial 
delay in performance decline. 
Subjective effects and performance.— 
Evidence was presented in an earlier 
paper to show that the performance 
effects induced by cortical analeptics 
and depressants could not be ac- 
counted for in terms of concomitant 
subjective disposition effects (5). 
Since this finding was contrary to 
what many have believed about the 
role of attitudes and dispositions in 
performance, there seemed good rea- 
son to seek verification of it in the 
present study. Accordingly, task dis- 
positions and attitudes were sampled 
by a 13-item questionnaire, similar to 
the one used earlier, at the conclusion 
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of the work period. Each S’s re- 
sponse to each item was scored on a 
3-point scale; and a composite score, 
which could range from 13 (un- 
favorable) to 39 (favorable), was 
derived for each S. The variation in 
terminal performance levels was ad- 
justed for multiple regression upon 
pretreatment levels and disposition 
indexes, then analyzed in order to 
determine the significance of the main 
experimental effects and their inter- 
actions. Suffice it to say that our 
previous findings with respect to (a) 
the effects of drugs upon subjective 
dispositions, and (6) the functional 
connection between these and per- 
formance effects, were confirmed. 


SUMMARY 


One hundred forty-four Ss received prelinti- 
nary training on a complicated compensatory 
pursuit task involving simulated aircraft instru- 
ments and controls, then continued work for 4 
hr. after having been distributed among 36 
combinations of three directive feedback treat- 
ments, three incitive feedback treatments, and 
four pharmacological treatments. The treat- 
ment effects were appraised in terms of subjective 
dispositions and task performance, and the 
functional connection between these outcomes 
was examined. 

Directive feedback signals induced an im- 
mediate increment in task proficiency. The 
increment was a positive function of the speci- 
ficity of the information supplied, and it re- 
mained essentially constant for the duration of 
the task when all other experimental effects were 
pooled. However, it was not maintained at 
former levels during the terminal stages of 
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performance under analeptic and depressant 
conditions. There was no condition, nor com- 
bination of conditions, under which directive 
feedback could be said to have postponed work 
decrement. 

Incitive returns also induced an immediate 
increment in task proficiency. This increment 
was orthogonal to all other experimental effects; 
but it varied with task duration, first, by growing 
progressively larger for approximately the first 
60 min., then by growing progressively smaller 
throughout the remainder of the task. Thus, 
incitive returns could be said to have postponed 
work decrement for no longer than an hr., either 
singly or in combination with other experimental 
variables. 

Dextro-amphetamine sulfate was singularly 
effective in minimizing work decrement, while 
the mixture of hyoscine and diphenhydramine 
hydrochloride hastened and maximized it. 
Significantly divergent task dispositions induced 
by these preparations were shown to be unable to 
account for corresponding divergences in per- 
formance. 
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COMPARISONS AND RANK ORDER 
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In 1930, Hevner (4) published a 
study in which scale values arrived at 
for 20 different samples of handwriting 
evaulated by 370 Ss were compared 
for the method of paired comparisons, 
rank order, and equal appearing 
intervals. She concluded, in part, 
that scale values from the methods of 
paired comparisons and rank order 
were linearly related and practically 
equal. 

In order to avoid differences due to 
the statistical treatment of her data, 
Hevner resorted to the following pro- 
cedure: 


The rank order for each of the specimens for 
each subject was first recorded and especially 
prepared data sheets were then used to tabulate 
for each subject the number of times each speci- 
men was preferred to every other specimen. 
The results of this method, in the final form, 
are, therefore, comparable to the results of the 
method of paired comparisons, and the treatment 
of the results was the same for both methods 
(4, p. 196). 


By comparing the scale values ob- 
tained from these derived data with 
those obtained directly from the 
method of paired comparisons, Hev- 
ner was able to demonstrate the near 
equivalence of the two sets of scale 
values. The application of this find- 
ing to later work leads to some 
interesting relationships. 

In 1928, Guilford (1) had suggested 
a short-cut method for finding scale 
values for the method of paired com- 
parisons. This method he later called 
the Composite-Standard Method and 
applied it to both paired comparisons 
and rank-order data (2). This pro- 
cedure deals with the total number of 
choices for any one item (C) in the 
method of paired comparisons and 


with the sum of the ranks (>> fR) for 
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each item in the method of rank 
order (3). 

He arrived at proportions from the 
method of paired comparisons by use 
of the formula 


C+ .5N 
_ nN ’ (1) 


where n is the number of items and 
N the number of judges, and showed 
that the scale values arrived at by the 
use of these proportions are linearly 
related to those obtained from the use 
of more elaborate procedures (3, p. 
170). 

Similarly, by summing the rank 
values (> fR) for each item (3, p. 
179), he arrived at a set of proportions 
which are treated in the same manner 
as those from paired comparisons: 


_ CFR -.5N 
Pi>cs = nN ’ (2) 


M, 


where n and N have the same mean- 
ings as above and R = 1 is the lowest 
rank. (Therefore, if r= 1 is the 
highest rank, R = n — r+ 1.) 

Now since the scale values arrived 
at by the use of Equations 1 and 2 
involve identical statistical proce- 
dures, and since Hevner has shown 
that the scale values arrived at by the 
two different methods are the same, 
it follows that the proportions ob- 
tained from Equations 1 and 2 must 
be the same if the same N judges 
evaluate the same n items. It fol- 
lows, therefore, that 


C+5N _ DUfR—5N 3) 
nN nN : 
whence, if r = n — R + 1 (i.e., when 
r = 1 is highest rank), 
C= Na->r, (4) 
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and if we substitute >> r = N#, then 
C = N(n — f) (5) 


which provides us with a means of 
predicting what the total number of 
choices for each item would have been 
in the method of paired comparisons 
from a knowledge of the mean ranks 
obtained by the method of rank order 
or, conversely, to predict mean ranks 
from the total number of choices in 
paired comparisons. 

The relationship stated in Equation 
5 may be arrived at on purely ra- 
tional grounds. Suppose all N judges 
ranked the same item as | (most pre- 
ferred), and suppose that in all paired 
comparisons they consistently chose 
this item over all (n — 1) alternatives. 
Then each judge would choose this 
item in (m — 1) comparisons and the 
total number of choices would be 


C = N(n — 1). (6) 


Similarly, if another item were 
ranked last, or nth, by all N judges, 
its mean rank would be n. In all of 
the paired comparisons involving this 
item, one of the other (m — 1) items 
would be chosen, so that the total 
number of choices for this particular 
item would be zero. 

We now have two points of refer- 
ence as follows 


=1, C=N(n-1) 
n, C=0 


and if we assume that a straight line 
joins these points, we arrive at the 
relationship 


C = N(n — #), 


which is the same as Equation 5. 

This equation states a relationship 
which must apply to the raw data of 
both of the methods and provides the 
basis for Hevner’s demonstration of 
the equivalence of the scale values 
obtained from them. It now remains 
to be shown that the relationship 
stated in Equations 5 and 7 actually 
applies to experimental data. 


i 
| 


’ 


(7) 
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TABLE 1 


Tota. NumBer or Cuorces (C) FROM THE 
Metuop or Partrep CoMPARISONS AND 
Mean Rank (7) From THE METHOD 
or Rank Orpver For 67 JupGES 
ARRANGING 11 ITEMs 














n Obtained Derived Obtained Deives 

Cc from C r from 7 
Sy 447 433 | 4.20 | 456 
GO 421 4.72 4.88 410 
LO 448 4.32 4.35 446 
SQ 194 8.11 8.07 196 
PS 255 7.20 7.02 267 
VS 151 8.75 8.71 153 
OS 171 8.45 8.13 192 
MC 534 3.02 3.67 491 
BB 212 7.84 7.67 222 
JO 374 5.42 5.43 373 
SO 478 3.87 3.82 481 














Two sets of empirical data are presented. 
The first set, shown in Columns 2 and 4 of Table 
1, was obtained from 67 university students (40 
men and 27 women) who were asked to indicate 
their preferences for 11 different kinds of music 
(symphony, grand opera, light opera, string 
quartet, piano solo, violin solo, organ solo, 
musical comedy, brass band, jazz orchestra, and 
swing orchestra) first by the method of paired 
comparisons and then by the method of rank 
order. This was accomplished by listing the 
55 comparisons on a mimeographed sheet in 
optimal order (5) in the form 


—Symphony Orchestra vs. Jazz Orchestra 
——Light Opera vs. Swing Orchestra 








and requiring that the preferred member of each 
pair be indicated by a checkmark. The Ss were 
then instructed to turn the page and arrange 
the 11 items in rank order with 1 indicating the 
most preferred and 11 the least preferred 
position. 

Column 2 of Table 1 shows the total number 
of choices (C) obtained from the method of 
paired comparisons and Column 4 shows the 
mean ranks (#) obtained from the method of 
rank order. 

These two groups of data are plotted against 
each other in the upper line of Fig. 1. The line 
drawn through these points is Equation 5 with 
N =67andn= 11. (Note that the x-intercept 
is n, the number of items, and the y-intercept is 
nN, the denominator of Equations 1 and 2.) 

The approximation of the empirical data to 
the requirements of Equation 5 is sufficiently 
good to justify accepting the equation as a 
reasonable statement of the relationship existing 
between them. 

A second set of empirical data is presented in 
Table 2. By the same procedure as that out- 
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Fe. 1. Total number of choices obtained 
from the method of paired comparisons plotted 
against mean rank obtained from the method 
of rank order. Upper series for 67 judges 
arranging 11 items; lower series for 57 judges 
arranging 6 items. Curves are obtained from 


Equation 5. 


lined above, 57 men were asked to express their 
preference for Masculine Men, Average Men, 
Effeminate Men, Masculine Women, Average 
Women, and Effeminate Women, with the dif- 
ference, however, that a month intervened 
between the administration of the methods of 
paired comparisons and rank order. The total 
number of choices (C) and the mean ranks (#) 
are shown in Columns 2 and 4, respectively, of 
Table 2, and are plotted against each other in 


TABLE 2 


Totat NumsBer or Cuorces (C) FROM THE 
Metuop of Parrep CoMPARISONS AND 
Mean Rank (f) FRoM THE MeETHOD 
oF Rank Orper For 57 JupcEs 
ARRANGING S1x Items 








. ? Cc 
n Chtgined Derived Congas Derived 
from C from f 





MM 211 2.30 2.36 207 
AM 252 1.58 1.50 255 


EM 21 5.63 5.83 10 
MW 94 4.35 4.29 97 
AW 189 2.68 2.62 193 
EW 88 4.46 4.36 . 94 

















the lower line of Fig. 1. The line drawn through 
the points is Equation 5 with N = 57 and 
n= 6. Again the approximation otf the theo- 
retical relationship to the raw data is satis- 
factory. 

By the use of Equation 5 it is possible, of 
course, to transmute total choices to rank orders 
and vice versa. Columns 3 and 5, respectively, 
in both tables indicate mean ranks derived from 
total choices and total choices derived from mean 
ranks. The closeness of fit of the data to the 
theoretical relationship may be observed by 
comparing the obtained and derived values of 
(C) and (#) in these two Tables. Equation 4 
can be used in this fashion when it is desired 
to obtain mean ranks from paired comparisons, 
especially in those cases where scale values are 
undesirable because of the doubtful normality 
of the distribution of judgments. 


SUMMARY 


It has been shown, both rationally and 
empirically, than when N judges are asked to 
indicate their preferences for m items by both 
the method of paired comparisons and the 
method of rank order, a linear relationship of 
the form 


C = N(n—#) 


will be found to hold between the total number 
of choices (C) from the paired comparisons, and 
the mean rank (#) from the rank orders. 

Inasmuch as this relationship holds, the pro- 
portions and scale values arrived at by the 
application of Guilford’s Composite Standard 
Method to paired comparisons and rank orders 
will be identical. The relationship is also 
implicit in Hevner’s procedure by which she 
demonstrated the equivalence of scale values 
arrived at from the method of paired com- 
parisons and rank order. 
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GENERALIZATION OF REINFORCEMENT AMONG 
SIMILAR RESPONSES 
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MADE IN ALTERED 


MELVIN H. MARX AND BENJAMIN B. BERNSTEIN 


University of Missouri 


Research on the spread of the in- 
fluence of reward has been confined 
until recently to the investigation of 
stimulus-response relationships of the 
ordinal type described in the typical 
spread-of-effect design. In a recent 
study by Marx and Bernstein (4) 
results were obtained, however, which 
indicated that generalization of rein- 
forcement may occur to stimulus- 
response connections not in ordinal 
proximity to reward. It was found 
that when the response to one of four 
synonyms in a list of words was 
rewarded, there occurred a strength- 
ening of the nonrewarded responses 
made to the three other synonyms 
independently of their ordinal posi- 
tions. This result was predicted on 
the basis of the Marx and Bunch 
generalization-of-reinforcement _hy- 
pothesis (5), which states that rein- 
response has a 
strengthening influence upon those 
other responses that are most closely 
related, along various dimensions, to 
the reinforced response. 

The present experiment was de- 


1 This research was supported in part by the 
United States Air Force under Contract No. 
AF 33(038)-25631, monitored by the Perceptual 
and Motor Skills Research Laboratory, Human 
Resources Research Center. Permission is 
granted for reproduction, translation, publica- 
tion, use, and disposition in whole and in part, 
by or for the United States Government. The 
opinions or conclusions contained in this report 
are those of the authors and are not to be con- 
strued as reflecting the views or indorsement of 
the Department of the Air Force. The co- 
operation of the Army ROTC unit at the 
University of Missouri in supplying Ss for this 
experiment is gratefully acknowledged. 


signed to test another implication 
derived from the generalization-of- 
reinforcement hypothesis. The spe- 
cific prediction made was that reward 
of the selection of one stimulus from a 
set of differing stimuli would result in 
an increased tendency toward repeti- 
tion of a second, nonrewarded selec- 
tion of one of a number of slightly 
varied forms of the particular kind 
of stimulus first selected. In other 
words, there would be a spread or 
generalization of reinforcement from 
the first rewarded response to a non- 
rewarded response related to it on the 
basis of stimulus similarity. 


MeTHOD 


Subjects—The Ss were 210 male college 
students naive as to the purpose of the experi- 
ment. They were obtained from the freshman 
and sophomore classes of the Army ROTC unit 
at the University of Missouri. An additional 
set of 41 similar Ss was run in a supplementary 
experiment to test certain hypotheses suggested 
as alternative explanations of the major results. 
The special conditions under which these Ss were 
run are described along with the results. 

A pparatus.—An automatic LaBelle 2 X 2-in. 
slide projector was used to present line drawings 
of the stimulus material. A Hunter decade timer 
was used to indicate the time intervals for slide 
presentation. The Ss were seated along two 
tables which were partitioned by shields so that 
they could not observe each other’s responses. 
Four Ss were placed along a table. Each S 
responded by pressing one of five push buttons 
on a box that was before him. The buttons 
were arranged linearly to correspond to five test 
items similarly projected on the screen, thus 
permitting S to select one of the items by pushing 
the appropriate button, i.e., the one in the 
corresponding spatial position. A bank of red 
lights was on a panel just above the push 
buttons; all of the lights flashed on to indicate a 
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correct response. The Ss’ responses were 
recorded by two 20-pen Esterline-Angus Opera- 
tions Recorders, one for each table of Ss. 

Experimental design.—The general plan of the 
experiment was to reward each S following his 
selection of one of five stimulus items early in the 
test series, and then to measure later in the test 
series the strength of his nonrewarded response 
to a slide showing five variations of the stimulus 
item which had been first selected and rewarded. 
It was predicted that the mean difference 
between first- and second-trial nonrewarded 
responses for such experimental Ss would be less 
than the.corresponding difference for control Ss 
whose original selection of one of these same 
five stimulus items was not rewarded. 

In order to provide a single first-trial rein- 
forcement for control Ss and also to lengthen the 
test series, two complete sets of five different 
stimulus items were used. The first set con- 
sisted of gauge forms. The Ss whose response 
to one of these was rewarded were in Group GR 
(gauge rewarded). ‘The second set consisted of 
island shapes. The Ss whose response to one of 
these was rewarded were in Group IR (island 
rewarded). IR Ss not only provided controls 
for the GR Ss but also, since they were given a 
comparable island reinforcement, themselves 
constituted an additional set of experimental Ss. 
Their controls were provided by the GR Ss. 

A regular graded scale from low to high 
readings provided the variations within each of 
the five gauge forms. An increasing amount of 
the gauge face was blackened to indicate pro- 
gressively higher readings. Because of the 
irregularities in the island shapes, no such regular 
relationship was possible for them. Instead, a 
focal point, said to represent a military in- 
stallation, was placed on each island form. This 
shifted, roughly, in a left to right direction for 
successive representations of the same island. 
Since the test-item variations for the gauges 
were more regular and discriminable, it was 
predicted that the experimental-control differ- 
ences would be more marked for Group GR. 

Differentiation among the five basic stimulus 
items of each set was increased by having Ss 
memorize the names for each gauge and island 
before the test series was presented. 

General procedure.—Six to eight Ss were 
trained and tested at one time. Sets of Ss 
were assigned at random to one of the two 
groups. Preliminary instructions were given to 
orient S to the experimental situation. For 
purposes of motivation the task was described 
in terms of military operations. It was ex- 
plained that military personne! often must 
remember names of objects and particular details 
about such objects for communication’ at some 
later time. The Ss were told that the experi- 
ment was designed to test their ability to re- 
member these kinds of information. 


Memorization trials—Study slides were pre- 
sented with the correct name of the gauge or 
island beneath each form on the slide. Artificial 
names were used for islands (Pregas, Fortega, 
Kialo, Rokoj, Ankau). Common gauge names 
were used for gauges (Gas, Oil, Heat, Time, 
Speed). 

Following this, the separate items were 
presented one at a time with all five names. 
Each S was to choose the correct name for each 
item by pushing the button beneath it. If S 
made a correct choice, the bank of red lights 
flashed on. This complete cycle of study slides 
followed by the separate item slides was pre- 
sented four times for both islands and gauges. 

Test trials —Following the training task, Ss 
were given instructions for the test trials. As 
an introduction they were told that occasions 
arose in which more detailed information had to 
be remembered, such as the location of an army 
base on an island, or a particular reading on a 
gauge. It was emphasized that now they had 
to learn the correct answers as they went along, 
starting from no information and gradually 
building up knowledge about the slides as they 
were given “right” and “wrong” information 
following their responses. It was explained that 
the right answers had been assigned at random, 
and not according to any set pattern or system. 

A trial consisted of 12 slides, each remaining 
on for 8 sec. For each slide S had to make a 
choice from five possible answers. As before, 
the red lights were used to inform S when his 
answer was the correct one. 

Each slide projected line drawings of five 
objects. The S’s task was to pick out the 
particular object he thought might be the 
correct one and respond by pressing the ap- 
propriate button. Two kinds of slides were 
employed: (a) Master slides: These showed all 
of the five gauges, or islands, which were the 
same objects on which the Ss were trained. The 
first two slides of each trial were master slides. 
(b) Detail slides: These 10 slides showed each 
gauge, or island, in the five variations of its 
basic form (five degrees of gauge blackening, or 
five varying focal points on islands). 

The island and gauge slides were presented 
alternately. For Group GR, the island master 
slide preceded the gauge master slide, and island 
detail slides preceded gauge detail slides in pairs 
throughout the list. For Group IR the reverse 
sequence was used. The ordering of the 10 
detail slides was systematically rotated for 
different sets of Ss to control for position effects. 
Four trials were given by repeating the same list 
of 12 slides. A gray rest slide was presented for 
16 sec. between trials. The Ss were told that 
their score would be the total number of correct 
answers they gave. 

Only a single reinforcement was given in each 
of the first two trials. The reinforcement was 
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GENERALIZATION OF REINFORCEMENT 


administered to any response made on the second 
master slide in the list. The last two trials 
were not analyzed but were given with two 
additional reinforcements in order to avoid 
arousing the suspicion of the Ss. 

Analysis of data.—Strength of response was 
measured by the numerical difference between 
the push-button responses made to the same slide 
on the first and second trials (cf. 3, 4,6). The 
push buttons werc considered to be numbered 
from 1 to 5, from left to right across the response 
box. A strong response was therefore one which 
was minimally varied in ordinal number units. 

Thus, if a response of 3 were given in the 
first trial, a response of 2 or 4 tp the same 
stimulus on the second trial (represented by a 
response-difference score of 1) was considered as 
evidence of greater strength than a response of 
1 or 5 on the second trial (represented by a 
response-difference score of 2). A repetition of 
the response originally given, of course, repre- 
sented maximum strength. The response varia- 
tion score was in that case zero. Response- 
variation scores of increasing magnitude thus 
indicated decreasing degrees of response strength. 

In the present study a response-difference 
score was obtained for each S from the first- and 
second-trial responses made to the slide present- 
ing the five readings on the gauge which had been 
chosen, and reinforced, on the second master 
slide. The same kind of measure was obtained 
when the response to an island shape had been 
reinforced. This measure will be referred to as 
an E (experimental) score. A difference score 
for each S was also obtained from the responses 
to that slide presenting the stimulus which had 
been chosen, with responses not reinforced, on 
the first master slide. This measure will be 
- referred to as the C (control) score. Each S 
thus served as an experimental S for one category 
and a control S for the other.” 

The data for a number of Ss were eliminated 
from the major analyses for gross defects 
produced by failure to follow instructions. 
Obvious sequences and too many or too few 
responses were the common flaws. Thirty-seven 
of the original 149 Ss from Group GR and 26 of 
the 124 Ss in Group IR were eliminated. The 
x? for the difference in proportion of these Ss 
was 1.59 (3.84 required at the .05 level). 

In order to obtain a more unambiguous test 
of the hypothesis, records were also eliminated 
if S failed to repeat the reinforced master-slide 
response on the second trial. These could not 





2An exception to this generalization was 
made for seven Ss of the 210 who provided 
usable records. These seven Ss could not be 
used for control purposes because they failed to 
make the first-trial first-slide response which 
was used to determine the critical detail slide 
for control scoring. Their records were other- 
wise satisfactory and so they were not discarded. 
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TABLE 1 4 
Mean Intertrtat Response-DiFFERENCE 
Scores 
Group N Mean SD t 
E 112 1.55 | 1.11 
GR 2.69** 
Cc 93 1.98 | 1.18 
E 98 1.70 | 1.27 
IR 65 
Cc 110 1.81 | 1.12 
E 210 1.62 | 1.18 
GR+I1R 2.45* 
Cc 203¢ | 1.89 | 1.15 
*P = 02. 
*P = O01. 


t See footnote 2 for an explanation of this dis- 
crepancy in the control N. 


easily be used, since the critical detail slide could 
not be determined when two different master 
slides had been reinforced. Twenty-four Ss 
were eliminated from Group GR and 20 from 
Group IR for thus failing to repeat. 


RESULTS 


Major E/C analyses—The mean 
response-difference scores for the 
critical detail slide are shown in 
Table 1. The mean E score for 
Group GR was significantly lower, 
indicating greater response strength, 
than the mean C score. The differ- 
ence for Group IR was also in th¢ 
predicted direction but was not 
statistically significant. This more 
clear-cut E/C difference for the GR ° 
Group had been predicted on the basis 
of the greater regularity of the re- 
sponse scale on the critical detail slide 
for that group. 

The distributions of scores were not 
skewed enough to distort results ob- 
tained from the ¢ or F test. The 
variances did not differ significantly. * 

Combination of Groups GR and IR 
gave a mean E/C difference which 
was significant at the .02 level. Since 
the within-Ss correlation between E 
and C scores in both groups was not 
significant and the groups to be com- 
bined did not differ significantly, the 
unmatched ¢ test was applied. Al- 
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though a precise reference to parent 
population cannot be made, the com- 
binations of groups are presented to 
provide some picture of the over-all 
effect of the experimental variable. 
Any question concerning these com- 
binations does not apply to the in- 
dividual group ¢ tests. 

If the E scores tended to distribute 
differently from the C scores on the 
first trial, a chance distribution of 
responses on the second trial might 
have resulted in differences between 
group means. Therefore, the dis- 
tributions of first-trial responses for 
the two groups were plotted. The 
distributions were all essentially rec- 
tangular. The difference scores to be 
expected, given a chance rectangular 
distribution on the second trial, were 
calculated for the four groups. The 
range was only from 1.56 to 1.58. 
This negligible difference indicates 
that the obtained mean differences 
presented in Table 1 cannot be at- 
tributed to any differences in the 
distributions of first-trial responses. 

Exact repetition of response was 
also used as a measure of response 
strength. Both groups showed non- 
significant differences in the predicted 
direction (16% to 9% exact repeti- 
tions for GR, and 18% to 10% for 
IR). 

The data from Ss who failed to 
repeat the rewarded master slide were 
checked. There was a consistently 
greater proportion of exact error 
repetition in the eliminated Ss. The 
mean response-difference scores were 
also lower. These results suggest 
that these Ss constituted a somewhat 
different population, perhaps inferior 
in motivation and/or ability, since 
they avoided their errors to a lesser 
degree. In any case, all the E/C 
differences, although small, were in 
the predicted direction. 

Control of intratrial generalization 
effects—If there was a closer relation- 
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ship in the experimental S on each 
trial between the reinforced master- 
slide response and the erroneous 
response made on the critical detail 
slide, then the apparent intertrial 
response strengthening could well be 
an artifact of intratrial generalization 
effects. This generalization from 
master slide to detail slide could have 
occurred along two dimensions. First, 
the degree of blackening of the various 
gauges tended to differentiate them 
to varying degrees from the wholly 
unblackened master-slide gauges. 
Differentiation of the island shapes 
by positioning of the focal point did 
not produce this kind of gradient. 
Second, the spatial relations among 
the five push buttons offered a linear 
physical dimension for response gen- 
eralization for both GR and IR Ss. 

Frequency-distribution analysis of 
the GR data provided little support 
for generalization along the blackening 
dimension. There was actually a 
tendency for fewer Ss to choose the 
least blackened gauges: n’s of 12, 21, 
27, 29, and 23, from least to most 
blackened gauges. This compares 
with the control distribution of 19, 
19, 14, 26, and 15 Ss. 

The experimental GR Ss did show 
evidence of intratrial response gen- 
eralization along the push-button di- 
mension. Differences were computed 
between the master-slide button re- 
sponse and the subsequent critical 
detail-slide button response. The 
difference between the E and C dis- 
tributions of difference scores was 
significant at just short of the .05 
level (x? = 7.46 for 3 df; 7.82 re- 
quired). The experimental IR Ss 
showed a similar trend, but their 
distribution was not significantly dif- 
ferent from that of their controls 
(x? = 3.34 for 3 df). 

Stronger intertrial response-differ- 
ence mean scores were found for the 
Ss with least intratrial variation: 
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1.41, 1.40, 1.71, and 2.08, for 0, 1, 2, 
and 3-4 difference classes. No gra- 
dient was found for their controls: 
2.13, 1.70, 2.11, and 2.32, for com- 
parable positions. At each point the 
experimental mean was stronger. 

Partial correlations between first- 
and second-trial detail-slide responses 
are presented in Table 2. There was 
a significant intertrial error correlation 
with the relationship between rein- 
forced-response button position and 
critical detail-slide error buttdn posi- 
tion partialled out. This result in- 
dicates that this kind of intratrial 
generalization was not sufficient to 
account for the intertrial strength- 
ening. 

One further check was made. The 
12 experimental GR Ss who had 
selected the most similar (least black- 
ened) of the differentiated gauges 
were compared with the 19 control 
Ss who selected this gauge. These 
experimental Ss showed an unexpect- 


TABLE 2 


INTERTRIAL CORRELATIONS FOR RESPONSES 
on Criticat Detar, SLIDE wiTH 
RELATIONSHIP TO REINFORCED 
Response PARTIALLED Out 





ri2 RR CR 


Group N | r 


| 





Main Experiment 





E 112 06 05 

GR 2.69** 
Cc 93 —.27 | —.27 
E 98 — .05 — .05 

IR 71 
Cc 110 —.15 —.15 





Supplementary Experiment 























E 20 59 62 

GR 2.10* 
Cc 21 —03 | —.03 
E 21 .28 .24 

IR —_ 
Cc 20 .23 30 

*P = 05. _ 
** P = 02. 
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edly low intertrial mean difference 
score (1.00) compared with the high 
control score (2.58). The obtained 
difference was highly significant 
(¢ = 3.51, P< .0l). Both mean 
scores should have been high, since the 
least blackened gauge corresponded 
to Button 1. Intertrial differences as 
high as 4 were possible when the re- 
sponse on the first trial was to Buttons 
1 or 5. The comparable IR mean 
scores were both high (1.81 for 21 
experimental Ss, 1.91 for 23 controls). 

When these 12 experimental Ss and 
the 19 comparable controls were 
eliminated from the analyses, both the 
response-difference measure and the 
partial-correlation measure still fa- 
vored the experimental GR Ss, but 
failed of statistical significance. 


Supplementary experiment.—The relatively 
low intertrial correlations for experimental Ss 
(cf. Table 2) indicate that with only 12 stimulus 
slides in the series the well-motivated Ss were 
able to remember and thus to avoid errors. This 
suggests an alternative explanation of the results. 
It is that the errors were simply less well re- 
membered and thus less well avoided in the 
experimental groups. This hypothesis assumes 
that there was no true strengthening of an S-R 
relationship in the experimental group, but 
rather a weakening of retention of the errors. 

To test this hypothesis, the main experiment 
first trial was duplicated with more ROTC Ss. 
On the second trial, Ss were told to reproduce, as 
exactly as possible, all their first-trial responses. 
It was emphasized that the new instructions 
applied to errors as well as to correct responses. 
This follows the general procedure introduced 
by Nuttin (7). 

Superior rather than inferior retention was 
found for the experimental condition. The 
mean intertrial difference score for the experi- 
mental critical detail slide for the 20 GR Ss who 
repeated the rewarded response was .65, com- 
pared with 1.14 for their 21 controls. The 
corresponding value for the 21 experimental IR 
Ss was .95, compared with 1.45 for the 20 
controls. 


It may be assumed that a certain 
amount of guesswork was involved in 
the attempt to reproduce each of the 
first-trial errors. In this process any 
differential first-trial strengthening of 
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error tendencies as a function of 
reinforcement would make for a 
greater tendency towards repetition 
even under the new instructions. 
Strong statistical support for the inter- 
trial strengthening interpretation is 
provided by the greater partial r for 
GR Ss, shown in the lower part of 
Table 2, since this controlled for any 
influence of intratrial generalization 
mediated along the push-button di- 
mension. Moreover, this result can- 
not be attributed to the influence of 
intratrial generalization along the 
blackening dimension. Only two GR 
Ss selected the least blackened gauge 
on the first-trial critical detail slide, 
and their mean intertrial difference 
score (1.00) was above the group 
mean (.65). 


Discussion 


The fact that the stimulus situation 
was changed from master slide to detail 
slide makes any simple application of 
response generalization insufficient as an 
explanation of the intertrial relationships. 
Stimulus generalization is similarly in- 
adequate, since the E/C difference in 
number of exact repetitions was not 
significant. However, Hull’s suggestion 
of a combination of stimulus amd response 
generalization (1, p. 183) could explain 
the results. Such S-R_ generalization 
clearly overlaps with the generalization- 
of-reinforcement principle, particularly 
if stimulus mediation is explicitly in- 
volved, as in the present design. 

However interpreted, the present re- 
sults point up the need for analysis of 
complex stimulus and response gener- 
alization effects resulting from reinforce- 
ment. No empirical investigation has 
previously been directed toward the S-R 
generalization suggestion made by Hull. 
Moreover, this suggestion has received 
little theoretical attention, or even ad hoc 
utilization. As a matter of fact, it is not 
explicitly dealt with in Hull’s latest 
work (2). 

SUMMARY 


An experiment designed to test the Marx and 
Bunch (5) generalization-of-reinforcement hy- 


pothesis is reported. The Ss were first trained 
to discriminate among related stimulus items 
presented by a slide projector. Their response 
to one of five such forms was then reinforced, 
and the strength of a response to five variations 
of the reinforced stimulus was later measured. 
A reciprocal experimental-control design was 
used whereby each S served both as an experi- 
mental § for one set of stimulus items and a 
control for the other. 

Significantly stronger experimental responses 
occurred for the stimulus items (gauge forms) 
for which a more regular scale of stimulus 
variation was used. The second set of items 
(island shapes) also showed a difference in the 
predicted direction, but it was short of statistical 
significance. 

Control analyses suggested that the major 
experimental-control differences could not be 
explained as artifacts of intratrial generalization. 
A control experiment showed that the results 
were not due to a weakening of retention of 
errors due to the experimental treatment. 

The results lend some further support to the 
generalization-of-reinforcement hypothesis, or to 
the Hullian suggestion of stimulus-response 
generalization (1). Taken in conjunction with 
the positive results previously obtained (4), they 
suggest that spread or generalization of the 
influence of reinforcement need not be restricted 
to relations of ordinal proximity. 
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MEANINGFULNESS AND ISOLATION AS FACTORS 
IN VERBAL LEARNING 


GREGORY A. KIMBLE AND ROBERT H. DUFORT?! 


Duke University 


This series of experiments began as 
a study of the relationship between 
speed of learning and meaningfulness, 
with the intention of sampling this 
function for a wider range of mean- 
ingfulness values than had previously 
been employed. The results of the 
first experiment, however, suggested 
very strongly that the relationship 
could not be studied in an uncom- 
plicated way using the procedures 
employed there, because of an un- 
anticipated effect of meaningfulness. 
Experiments II and III examine this 
complicating factor more closely. 


EXPERIMENT [| 


The purpose of Exp. I was to obtain 
functions relating speed of learning to 
meaningfulness, using the paired- 
associate learning procedure. This 
method has the advantage of making 
it possible to include in a single set of 
learning materials items of varying 
degrees of meaningfulness, so that the 
complete function can be investigated 
on a single group of Ss. The experi- 
ment was performed in two replica- 
tions which will be referred to as IA 
and IB. Except for the learning ma- 
terials, they were identical. 


Method 


Learning materials—For each of the two 
replications, ten pairs of terms were selected 
from Noble’s (4) list on the basis of the following 
criteria: (a) They covered the complete range of 
m (meaningfulness) values. (b) Only words of 


1 We should like to thank Mr. James Hall for 
collecting the data for Exp. IA, and Mr. P. J. 
Baugh and Mr. William Rudge for running 
Exp. III. 


nearly the same m value were paired. (c) Ex- 
cept for two pairs of paralogs (pronounceable 
nonsense words) of nearly the same m value, an 
attempt was made to obtain a series of roughly 
equal steps along the meaningfulness continuum. 
(d) The lists for the two replications were 
different. The three words in common were 
used to make it possible to sample the entire 
meaningfulness continuum in each list. The 
pairs of terms and their m values appear in 
Table 1. 

These pairs of words were typed on memory- 
drum tapes and presented to Ss in six different 
haphazard orders on different trials. If S did 
not learn the list in six trials, the order in which 
the pairs were presented in Trials 7-12 was the 
same as that on Trials 1-6. Only two Ss in 
Exp. IA and one in Exp. IB required more than 
12 trials. For these Ss previous orders were 
repeated a second time. There was, however, 
no evidence that any Ss learned the order of the 
response terms. 

Subjects and procedure-—The Ss in each 
replication were 20 Duke University students. 
They received these instructions: “This is not a 
personality test of any sort. I am interested 
only in the words, not in the reactions of the 
subject. You will be shown a series of ten pairs 
of words. The first time the list is shown, you 
are to try to learn as many of the pairs as 
possible. Do not try to learn the order in which 
the items appear. After the list has been pre- 
sented once, it will be shown again: the same 
pairs but in different order. When you see the 
stimulus word, reply with the word paired with 
it in the first list. If you are not sure of the 
correct response, guess at it. The experiment 
will be continued until you get the entire list 
correct once.” ‘The Ss were not told that some 
of the items were not real words, nor were they 
told the pvrpose of the experiment. 

The words were presented by a Gerbrands 
memory drum which exposed the stimulus term 
for 3 sec. alone, and then presented the stimulus 
and response terms for 3 sec. together. The 
3-sec. exposure was selected on the basis of 
preliminary investigation as one which produced 
a minimum of difficulty in deciding whether the 
response was made soon enough to be counted 
as correct. 
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TABLE 1 


Pairs oF Worps AND THE m-VALUE or Eacu 











Experiment IA 


Experiment IB 








Stimulus m value Response m value Stimulus m value Response m value 
NEGLAN 1.04 | MEARDON 1.05 GOJEY 0.99 MEARDON 1.05 
TAROP 1.24 XYLEM 1.24 BYSSUS 1.13 NEGLAN 1.04 
FEMUR 2.09 | LOZENGE 2.09 FLOTSAM 2.19 | GRAPNEL 2.15 
PALLOR 3.06 | SEQUENCE 3.21 RAMPART 3.36 ARGON 3.34 
ORDEAL 3.91 ZENITH 4.44 QUOTA 4.68 YEOMAN 4.60 
QUARRY 5.10 | EFFORT 5.13 MALLET 5.61 KENNEL 5.52 
QUARTER 5.98 REGION e 5.98 UNCLE 6.57 ZERO 6.15 
ZEBRA 7.12 | GARMENT 7.17 JEWEL 7.58 INSECT 7.39 
WAGON 8.12 DINNER 8.33 OFFICE 7.95 HEAVEN 7.91 
ARMY 9.43 KITCHEN 9.61 MONEY 8.98 KITCHEN 9.61 


























Results and Discussion 


The results appear in Fig. 1 which 
presents median number of trials to 
the first correct association for each 
pair of words, for the two replications. 
The abscissa values are the average 
meaningfulness values of the two 
words in each pair. Medians are 
used as measures of learning because 
the distributions of the response meas- 
ures were systematically skewed. 
The two curves are similar, although 
that obtained in Exp. IB is less regular 
than that obtained in Exp. IA. By 
comparison with the results of previ- 
ous experiments on the relationship 
between speed of learning and mean- 
ingfulness, these curves differ chiefly 
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Fic. 1. Median number of trials to learn 
each pair of terms in Exp. IA and IB. The 
abscissa values are mean m value of each pair. 


at the low-meaningfulness end of the 
continuum. Earlier investigators 
(3,5) have obtained functions relating 
speed of learning to meaningfulness 
which were negatively accelerated and 
decreasing. In these previous ex- 
periments, nonsense materials were 
much more difficult to learn than any 
meaningful materials. The curves in 
Fig. 1, by contrast, show at most a 
slightly heightened difficulty of learn- 
ing one of the nonsense pairs. The 
others were learned more easily than 
several of the pairs of meaningful 
words. Presumably this difference 
between the present results and those 
of previous studies is somehow assign- 
able to the paired-associate procedure. 
Noble’s (5) results which were based 
on the same materials as those used 
here, but employing the serial learning 
technique, resemble the results of 
other serial learning studies more 
nearly than they do these. This 
suggests that the paired-associate 
procedure introduces a variable which 
makes the less meaningful materials 
easy to learn. 


One factor which might possibly act in 
this way is the isolation effect studied by 
von Restorff {6}. She found that an 


isolated terra such as a two-place number 
in a list of nonsense syllables was learned 
very easily. Her interpretation was in 
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terms of the figure-ground relationship 
stressed by gestalt psychologists. Ac- 
cording to this explanation, materials 
which are different from the context in 
which they are embedded stand out from 
that background and are learned easily. 
Just why such isolation should lead to 
easy learning is a matter which we shall 
discuss in connection with Exp. III. 
For our present purposes, the implication 
of this line of reasoning is that relatively 
meaningless materials stand out against 
the otherwise meaningful materials in 
lists of paired associates and that this 
isolation or contrast effect aids learning. 
Experiments II and III are a test of this 
general idea. 


ExpeERIMENT II 


The suggestion that a process like 
the one investigated by von Restorff 
is responsible for the precise form of 
the functions obtained in Exp. I 
implies the operation of a perceptual 
factor in rote learning. The purpose 
of this experiment was to test this 
implication. Specifically, it seemed 
likely that such a perceptual process 
would have a greater influence on the 
stimulus terms in paired-associate 
learning than on the response terms. 
Such reasoning leads to the conclusion 
‘that nonsense words in a paired- 
associate learning experiment would 
be easy to learn if these words were 
placed on the stimulus side, but that 
this effect would not appear when the 
same words are on the response side of 
the pair. 


Method 


Learning materials.—The list of paired asso- 
ciates learned by two different groups of Ss in 
this experiment appear in Table 2. It will be 
observed that each pair consists of a common 
three-letter word and a word selected from 
Noble’s list. Of the latter, the two terms, 
Xylem and Tarop, are paralogs and would 
presumably be meaningless or nearly so to all Ss. 
The remaining words are ones which almost all 
college students would at least recognize. The 
three-letter words are all of the consonant-vowel- 
consonant variety, and the vowel is always a. 


TABLE 2 


Partrs or Worps Usep 1n Exp. II 








Scaled Paired Scaled Paired 
Word Word Word Word 


TAROP MAN |} QUARRY | HAT 
XYLEM RAG REGION HAM 
FEMUR TAX ZEBRA WAR 
PALLOR CAT WAGON JAB 
ORDEAL | PAW ARMY BAY 




















This restriction upon the form of the words was 
based upon a preliminary experiment which was 
exactly like this one, except that the three-letter 
words were a heterogeneous collection without 
this systematic similarity. The results of the 
preliminary experiment were also like those of 
this one, but there was an unsatisfactorily large 
degree of variability in the data. The con- 
struction of this list represents an attempt to 
reduce such variability. 

In terms of the lists presented in Table 2, the 
prediction to which the hypothesis under ex- 
amination leads is that the two nonsense words, 
Tarop and Xylem, would be relatively easier to 
learn when they appear as stimylus words than 
when they are response words. f This prediction 
is on the assumption that the isolation phe- 
nomenon suggested by the results of Exp. I is a 
perceptual effect which would affect the stimulus 
terms more than the response weet 

Subjects and procedure.—The Ss were 28 Duke 
University students; 20 of them had served in 
Exp. IB. Since the results of this experiment 
were very similar to those obtained from naive Ss 
in the preliminary experiment, it seems unlikely 
that their previous practice at learning paired 
associates had any serious effect upon the out- 
come of this experiment. 

One group of 14 Ss learned the materials in 
Table 2 under an arrangement where the scaled 
words were the stimulus words. The other 
group of 14 Ss learned them with the scaled 
words as the response words. These two groups 
will be designated, respectively, as Group § 
(scaled words on the stimulus side), and Group 
R (scaled words on the response side). 

All procedural details were the same as in 
Exp. I. Since there were two groups in this 
experiment, care was taken to be sure that the 
order of pairs was the same on each trial for each 
group. ‘Thus any differences in learning must be 
attributed to the placement of the scaled words 
on the stimulus or on the response side of the list. 


Results and Discussion 


The hypothesis under investigation 
in this experiment was that the opera- 
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tion of a perceptual factor in paired- 
associate learning would make non- 
sense words relatively easy to learn 
when they were used as stimulus 
terms, but that no such benefit would 
appear when the same words were 
used as response terms. The results, 
in terms of number of trials to 
learn, confirm this prediction. For 
Group S, the mean number of trials 
to learn the two pairs containing 
meaningless words as stimuli was 
2.00; the mean number of trials to 
learn the pairs with meaningful words 
as stimuli was 2.36. For Group R, 
the mean number of trials required to 
learn the comparable pairs were 3.57 
and 2.59. In this group, of course, 
the words referred to above as stimuli 
were on the response side of the pair. 
On the assumption that most of the 
scaled words employed in this experi- 
ment are less meaningful than the 
three-letter words, a further implica- 
tion of these findings is that the effect 
of meaningfulness on learning may 
involve a response process as well as 
the stimulus one investigated directly 
in this experiment. The evidence for 
this is that both kinds of words were 
mastered more quickly by Ss in 
Group S. 

Both of these interpretations are 
supported by the results of an analysis 
of variance presented in Table 3. 
The significant F for experimental 
conditions supports the incidental ob- 
servation suggesting an effect of mean- 


TABLE 3 


ANALYSIS OF VARIANCE ON 
Resutts or Exp. II 























Source df — F p 
Conditions 1 | 11.52 | 10.87 .005 
Error between 26 1.06 — = 
Words 1 1.32 | 1.80 | >.20 
Interaction 1 6.17 | 8.45 | <.01 
Error within 26 73 — -~ 
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ingfulness through the response proc- 
ess. For the main purpose of the 
experiment, the critical F test is that 
for the interaction variance. The 
fact that it is significant shows that 
the difference between the number of 
trials to learn the two kinds of pairs 
was different from group to group. 
The sort of difference involved may 
be summarized by pointing out that, 
for Group S, the paralogs were easier 
to learn than the words; for Group R 
the words were easier to learn than 
the paralogs. 

The relevance of these results to the 
interpretation given for those of 
Exp. I is that they indicate that the 
isolating effect from which terms with 
little meaning benefit occurs only if 
these terms are used as stimuli. In 
this sense they support the implication 
that the isolating effect is a perceptual 
one which would presumably operate 
chiefly on stimuli. 


EXPERIMENT III 


Perhaps the best-known effect of 
isolation on verbal learning is its in- 
fluence on the serial position function. 
When an item different in kind from 
those constituting the rest of the 
serial list is inserted into the middle of 
the list, the usual bow-shaped form 
of the serial position curve is altered 
and a dip appears at the point where 
the isolated term occurs. The as- 
sumption that meaningfulness (really 
the lack of it) can serve to isolate a 
term leads to an interesting experi- 
mental prediction. If a nonsense 
term is put into the middle of a list of 
otherwise meaningful words, it should 
be learned easily. Furthermore, if 
degree of meaningfulness determines 
the degree of isolation, it should be 
possible to manipulate the magnitude 
of this effect by varying the degree of 
meaningfulness of the critical word. 
The interesting feature of this pre- 
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diction is that, in such an experiment, 
the speed of learning should vary 
inversely with degree of meaningful- 
ness. The least meaningful term 
should be the easiest to learn. This, 
of course, is exactly the opposite of the 
outcome of most experiments on the 
relationship between meaningfulness 
and learning. Experiment III tests 
this prediction for four degrees of 
meaningfulness. 


Method 


Learning materials—Four groups of Ss 
learned serial lists of 13 words which were iden- 
tical except for the middle word. For each 
group, 12 of the words were 12 of Noble’s 14 
most meaningful words. The order of these 12 
words in the list was: heaven, dinner, captain, 
kitchen, garment, jewel, village, wagon, army, zebra, 
money, insect. For the four groups, different 
words were inserted at the seventh position, 
following jewel. These words with their asso- 
ciated m values were: gojey (0.99), rampart 
(3.22), kennel (5.52), and office (7.95). The last 
word, office, is from the same meaningfulness 
range as the other 12 words. The others came 
from less meaningful portions of the continuum. 
Gojey is a paralog. 

Subjects and procedure.—The Ss were 80 Duke 
University undergraduates, in four groups, with 
15 men and 5 women in each group. Several 
additional Ss were discarded on the basis of two 
criteria: (a) They reported learning the list in a 
narrative fashion, and (b) they learned the list 
in five trials or fewer. 

Instructions to Ss were Yale (2) instructions 
modified chiefly by omitting the instruction not 
to make up fanciful associations among the 
words. Preliminary work indicated that this 
instruction produced exactly the opposite of the 
desired effect. 

Each S learned the list to one errorless repeti- 
tion. The words were exposed by a Gerbrands 
memory drum at a 3-sec. rate. Lists (trials) 
were separated by 5 sec. 


Results 


The purpose of this experiment was 
to determine the form of serial posi- 
tion curve for lists which are identical 
except for the middle word and, with 
the same exception, fairly homo- 
geneous with respect to the meaning- 
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Fic. 2. Mean number of total errors as a 
function of serial position. 


fulness of the individual items. One 
such presentation of the results ap- 
pears in Fig. 2.—fhe plot there is in 
terms of oe number of errors 
at each position in the series. Figure 
3 presents the same relationship in 
terms of mean number of intrusion 
errors. Each of these presentations 
shows the dip predicted for the curve 
representing the data for the list 
containing the nonsense word, gojey, 
and each shows that the word, ram- 
part, which is next lowest on the 
meaningfulness continuum was next 
easiest to learn. In terms of total 
number of errors, kennel was slightly 
harder to learn than office which 
represents an inversion of the order 
predicted. In terms of intrusion 
errors, however, there is a regular 
increase from gojey which produced 
the fewest to office which produced the 
most. Analysis of variance reveals 
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Fic. 3. Mean number of intrusion errors as a 
function of serial position. 
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that the relative difficulty of learning 
the critical word differed significantly 
from group tofgroup (F = 9.01; 
~ < .001). This’ analysis was per- 
formed on the percentage of total 
errors occurring in the critical posi- 
tion. Such treatment of the data 
minimizes the effect of initial differ- 
ences in learning ability. 

Further examination of the func- 
tions presented in either Fig. 2 or 3 
shows that the word used as the 
middle term apparently had some 
effect on the ease with which several 
of the immediately preceding and 
following words were learned. This 
influence is most obvious for the list 
containing gojey in which the adjacent 
words show a marked rise in number 
of errors. Detailed inspection of the 
records of individual Ss provided no 
satisfactory explanation for such an 
effect. 


Discussion 


The results of these three experiments 
leave little room for doubt that an isola- 
tion factor benefits the learning of non- 
sense materials when they are presented 
among more meaningful materials. In 
Exp. I this effect produced a flattening 
of the curve at the low-meaningfulness 
end of the continuum. In Exp. II it 
was demonstrated that the effect is 
chiefly on the stimulus terms, which is 
compatible with the idea that the process 
is a perceptual one. Experiment III 
provided additional support for such an 
interpretation by demonstrating, as pre- 
dicted, that the ease of learning the 
middle word in a list of highly meaningful 
words varies inversely with the meaning- 
fulness of that term. 

Results such as those just summarized 
fit readily into the framework of gestalt 
theorizing and pose a problem for the 
more conventional S-R point of view. 
For S-R theory, the problem is to reduce 
the isolation phenomenon to S-R con- 
cepts. Gibson (1) has proposed a theory 
which will do this for the results of 


GREGORY A. KIMBLE AND ROBERT H. DUFORT 


Exp. I and II, which involved paired- 
associate learning. With some additions 
the same ideas will handle the results of 
Exp. III. Gibson’s approach to paired- 
associate learning treats the situation as 
one in which the stimulus terms in the 
list have the same properties as other 
stimuli and the response terms have the 
same properties as other responses. On 
this general assumption, Gibson then 
proceeds to the hypothesis that the 
response terms will sometimes be evoked 
by stimulus terms other than those to 
which they are correct as a result of 
stimulus generalization. To the extent 
that the various stimulus terms in a 
paired-associate list are similar, every 
stimulus word will tend to attract, as 
responses, words from other parts of the 
list. Such responses will sometimes 
occur as overt errors; at other times they 
will merely inhibit the correct response 
which also tendstooccur. The difficulty 
of learning any pair of words will, by this 
reasoning, vary directly with the number 
of competing associations the stimulus 
word attracts. Since the process re- 
sponsible for such attraction is stimulus 
generalization, stimulus terms which 
differ markedly from those in the rest of 
the list will attract few associations and 
be learned with relative ease. To ac- 
count in these terms for the ease with 
which the relatively meaningless items 
were learned in Exp. I and II, it is 
necessary to make two assumptions: 
(a) That meaningfulness is an attribute 
that applies to the stimulus aspect of 
words, so that it can provide a dimension 
along which stimulus generalization can 
operate. The procedures used for scal- 
ing meaningfulness and the results of 
Exp. II both support this assumption. 
(6) That the terms with very low degrees 
of meaning differ more on this dimension 
from several of the more meaningful 
terms than these latter terms differ 
among themselves. Then it follows that 
paralogs, for example, will receive rela- 
tively few competing associations from 
other items in the list, and will be learned 
rapidly. 

The extension of such reasoning to the 
results of Exp. III poses an interesting 
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problem. In that experiment, both total 
number of errors and number of in- 
trusion errors at the critical point in the 
list were smallest when the correct re- 
sponse was a nonsense word, and greater 
when the correct response was a mean- 
ingful word. The results, thus, confirm 
those of the previous two experiments in 
showing that, under certain circum- 
stances, an item with little meaning can 
be learned easily. The problem is one of 
interpretation. In the previous experi- 
ments, it was possible to attribute the 
differences in ease of learning to differ- 
ences in the various stimulus words 
involved in different pairs of terms. In 
this experiment, however, the stimulus 
word was the same for each group, the 
word jewel. The problem is to explain 
how, in the different groups, the word 
jewel could have attracted different 
numbers of associations leading to differ- 
ent numbers of errors, and how the 
number of such associations could have 
been determined by the meaningfulness 
of a response word which did not occur. 

About the only plausible solution to 
this problem appears to be in the assump- 
tion that the correct response did actually 
occur in some implicit or covert form and 
that, functioning as a stimulus, it called 
up associations which interfered with its 
own appearance. The data provide evi- 
dence in favor of part of this assumption 
in the fact that the results of Exp. III 
are much the same whether they are 
expressed in terms of intrusion errors or 
in terms of total errors. This supports 
the contention that the results are ex- 
plainable in terms of competing asso- 
ciations. 

There remains the problem of account- 
ing for the dependence of such competing 
associations upon the meaningfulness of 
the corre@t response. Two possibilities 
suggest themselves: (a) A meaningful 
word like kitchen, by the very definition 
of meaningfulness, possesses more asso- 
ciative bonds than a meaningless word 
like gojey. It might be that these previ- 
ously acquired associations are called up 
and that they interfere with the overt 
appearance of the correct response. 


(4) Following Gibson’s hypothesis, a 
meaningful word like kitchen would be 
expected to attract more generalized as- 
sociations from within the list because of 
its similarity to other items. Such evi- 
dence as comes from the experiment 
supports the latter interpretation. 
About 98% of the intrusion errors ob- 
tained from Ss in Exp. III were responses 
which would have been correct at some 
other point in the list. If the associa- 
tions which § brings to the experiment 
with him were important, more extrane- 
ous intrusion errors should have occurred. 


SUMMARY 


Three experiments on the relationship be- 
tween meaningfulness and speed of learning are 
reported. 

In Exp. I, 40 college students learned paired- 
associate lists in which different pairs of terms 
were of different degrees of meaningfulness. 
The results of this experiment differed from those 
obtained in previous experiments chiefly in that 
the terms with least meaning were much easier 
to learn than would have been predicted from 
earlier research. This suggested the operation 
of the isolation factor studied by the gestalt 
psychologists. 

Experiment II was performed to determine 
whether the isolation effect would be greater 
when the isolated nonsense terms were stimulus 
terms, as compared with response terms. ‘Two 
groups of 14 Ss learned paired-associate lists in 
which one term was a very common three-letter 
word and the other was a disyllabic word from 
Noble’s (4) list. Of the latter, two were para- 
logs. One group learned this list with the 
disyllables as stimulus words; for the other 
group the three-letter common words were the 
stimulus words. Results of the experiment con- 
firmed the prediction from the isolation hy- 
pothesis that the disyllabic paralogs would be 
easier to learn as stimuli than as responses. 

In Exp. III, four groups of 20 Ss learned lists 
of words which were identical except for the 
middle term. The middle term varied in mean- 
ingfulness for the four different groups. This 
manipulation had a marked effect upon the form 
of the serial position curve. The list which con- 
tained a nonsense term in the middle resembled 
that usually obtained in studies of the isolation 
phenomenon, showing a dip in the middle. For 
the other three groups, the serial position curves 
were more nearly “normal,” but in general, the 
more meaningful the middle term, the harder it 
was to learn. 








368 GREGORY A. KIMBLE AND 


An S-R interpretation patterned after Gib- 
son’s (1) theory is offered for the results of these 
experiments. 
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FREQUENCY OF SEEING AND RADIAL LOCALIZATION 
OF SINGLE AND MULTIPLE VISUAL STIMULI! 


H. W. LEIBOWITZ, NANCY A. MYERS, AND D. A. GRANT? 


University of Wisconsin 


In a recent experiment (1) the 
accuracy of radial localization of a 
single visual stimulus was investigated 
as a function of the luminance and the 
duration of exposure. The results 
demonstrated that radial localization 
accuracy of a single stimulus, when 
visible, is constant for all luminance- 
duration combinations. Increase in 
either the luminance or duration of 
expasure, while increasing the fre- 
quency of seeing the stimulus, had no 
concomitant effect upon the accuracy 
of localization. The results were 
interpreted as demonstrating that the 
magnitude of the primary sensory 
events does not limit the accuracy of 
the mechanism of localization, since 
the variables of luminance and dura- 
tion, which have a well-known relation 
to such primary processes (2-5), have 
no effect upon localization accuracy. 
The present experiment extends the 
original study to determine the effect 
upon localization accuracy, under 


-various conditions of luminance and 


duration of exposure, of multiple as 
well as of single stimuli. 


APPARATUS AND PROCEDURE 


The basic instrument employed was a 
modified Dodge tachistoscope? which provides 


1 This research was supported in part by the 
United States Air Force under Contract AF 
33(038)-23294, monitored by the Armament 
Systems Personnel Research Labo.atory, Air 
Force Personnel and Training Research Center, 
Lowry Air Force Base, Denver, Colorado. Re- 
production in whole or in part is permitted for 
any purpose of the United States Government. 

2 Presented at the annual research conference 
on vision, Ohio State University, 1954. The 
authors wish to acknowledge the assistance of 
J. F. Lomont in obtaining the data. 

3 Manufactured by Ralph Gerbrands, Arling- 
ton, Mass. 


for the rapid alternation of adapting and 
stimulus fields for controlled durations of 
exposure. The adapting field was dark with a 
fixation point in the center, and was replaced for 
durations ranging from .01 to .64 sec. with the 
stimulus field consisting of one or more orange, 
circular stimuli, provided by neon glow tubes, 
viewed against a black background. The tubes 
were mounted in holes drilled on a board, and the 
side facing S was covered with onionskin paper. 
At the completion of the desired stimulus ex- 
posure, the adapting field reappeared immedi- 
ately and, .25 sec. later, radial white grid lines, 
each 5.85° long and spaced at 10° intervals, 
appeared in the adapting field for approximately 
1 sec. The Ss observed monocularly through a 
2.5 diameter artificial pupil at a viewing distance 
of 24.5 in. The stimuli, the centers of which are 
always 4.2° from the center of the field, could be 
presented in any desired radial position. Dura- 
tion of exposure was adjusted by means of a 
mechanical timer in the tachistoscope lighting 
circuit, while luminance was controlled by plac- 
ing neutral filters in front of the artificial pupil. 
The maximum luminance of the stimuli, when 
there were no filters in the apparatus, was 
5.12 mL. The Ss, male students who were paid 
for their services, were asked after 10 min. dark 
adaptation to observe a fixation point in the 
center of the field and to report in which of 72 
radial positions the stimulus or stimuli had 
appeared. The stimuli were presented ran- 
domly in all 72 positions at fixed luminance- 
exposure time combinations. The Ss estimated 
the positions of the targets within 5° in terms 
of a system of polar coordinates which had 
previously been explained to them. 

Two experiments were performed. In Exp. I, 
a fixed number of stimuli (i.e., 2, 3, 4, 5, or 6) 
were presented to five groups of six Ss each. 
In Exp. II, six Ss received random presentations 
of stimuli varying in number from one through 
seven. The number of luminance-duration 
combinations employed was determined by the 
time required to test Ss. In Exp. I, for the 2, 3, 
and 4 stimulus groups, luminance values of .02, 
.08, .32, 1.28, and 5.12 mL were used in con- 
junction with durations of .01, .04, .16, and .64 
sec., making a total of 20 luminance-duration 
combinations. For the 5 and 6 stimulus groups, 
luminance values of .32 and 5.12 mL were used, 
together with durations of .01, .08, and .64 sec., 
yielding six combinations. All group members 
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Fic. 1. Frequency of seeing functions and mean radial localization error per reported 
stimulus as a function of luminance at exposure durations of .01, .04, .16, and .64 sec. for pres- 


entations of 2, 3, or 4 stimuli. 


received 50 sets of stimuli under each luminance- 
duration combination. Thus, each member of 
the 2, 3, and 4 stimulus groups received 1,000 
sets of stimuli; those in the 5 and 6 stimulus 
groups saw 300 sets of stimuli. In Exp. II, 
only two combinations were employed, .32 mL 
at .01 sec. and 5.12 mL at .64 sec. exposure. 
Fifty sets of each of the varied numbered stimuli 
(1 through 7) were presented at each of these 
combinations, giving a total of 700 sets of 
stimuli presented to these Ss. 

The experimental design provided for the 
random variation of the position of the stimuli, 
as well as randomization over Ss of the order in 
which the luminance-duration combinations 
were presented. Analysis was made of the 
“frequency of seeing” of the stimuli, the localiza- 
tion error, and possible learning effects. 


RESULTS 
Effects Due to Practice 


No learning effects were observed 
in any phase of the present study. 
With adequate instructions, the S’s 
performance accuracy did not vary 
with practice. 

Frequency of seeing and localization 
accuracy.—In order to demonstrate 


the relationship between frequency of 
seeing and the localization error, both 
sets of data for the 2, 3, and 4 stimulus 
cases are plotted in Fig. 1.4 This 
graph illustrates the lack of corre- 
spondence previously observed for a 
single stimulus (1), between the fre- 
quency with which the stimuli are 
seen, and the radial localization error. 
The previous observation, that local- 
ization accuracy is unaffected by an 
increase in the stimulus energy, is here 
confirmed for 2, 3, and 4 stimuli. 
This principle also applies to the 5 
and 6 stimulus cases which were 
investigated for a more limited range 
of luminance values.‘ 

Localization error as a function of the 
number of stimuli.—It will be observed 
in Fig. 1 that although the localization 
error is constant for a given lumin- 


‘Tables from which Fig. 1 and 2 were con- 
structed have been deposited with the American 
Documentation Institute. Order document No. 
4685 remitting $1.25 for microfilm or $1.25 for 
photocopies, 
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ance-duration combination, the mag- 
nitude of the error increases with the 
number of stimuli. This relation is 
demonstrated in Fig. 2 where the 
average localization error is averaged 
over all luminance-duration com- 
binations and plotted as a function of 
the number of stimuli employed for 
both Exp. I and II. The function is 
similar for Exp. I in which six different 
Ss served in each stimulus-number 
group, and for Exp. II in which the 
same Ss were presented with varying 
numbers of stimuli. (The datum for 
the one stimulus case for Exp. I has 
been taken from the first paper of this 
series [1].) In both cases, localiza- 
tion error increases with increase in the 
number of stimuli, rapidly at first and 
then more slowly, before approaching 
a limiting value. 

Percentage of stimuli reported and 
percentage of stimuli reported correctly. 
—In the present study the Ss are 
asked to estimate the radial position 
of the stimuli under various lumin- 
ance-duration conditions and for vary- 
ing numbers of presented stimuli. At 
the same time, they are reporting the 
number of stimuli seen, a response 
measure which is of interest in con- 


‘nection with the formulation by 


Schlosberg (6) to provide a rationale 
for the observation by Hunter and 
Sigler (5) that the stimulus energy 
required to perceive correctly a given 
number of stimuli increases with the 
number presented. Schlosberg has 
assumed that the perception of a 
single stimulus is an independent 
event, so that given the probability of 
seeing any one stimulus as P, the 
probability of seeing all N stimuli 
which are presented will be P*. This 
assumption of independence would 
predict that the percentage of stimuli 
reported will be independent of the 
number presented, provided the num- 
ber is less than the limit of the “span 


of perception” (5). Table 1, which 
presents the percentage of stimuli 
reported and the percentage of stimuli 
reported correctly may be considered 
in the light of Schlosberg’s assump- 
tions. At the 5.12 mL—.64-sec. com- 
bination, the data are clearly not in 
accord with the hypotheses, although 
the role of eye movements at this 
longer duration may introduce a 
complicating factor. The percentage 
of stimuli reported and the percentage 
of stimuli reported correctly are 
plotted in Fig. 3 for the .32 mL—.01- 
sec. combination. The percentage of 
stimuli reported is in agreement with 
the assumption of independence. 
Schlosberg’s theoretical curve, com- 
puted on the basis of the observed 
probability of seeing one stimulus, 
has been fitted to the data for the 
percentage of stimuli reported cor- 
rectly. The agreement is good for 
the 1, 2, 3, and 4 stimulus cases, but 
the theoretical function overestimates 
the data for 5, 6, and 7 stimuli. 


Discussion 


The results of the present experi- 
ments confirm the previous conclusion 
concerning the independence of the 
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Fic. 2. Mean localization error per reported 
stimulus in degrees of arc as a function of the 
number of stimuli presented. Exp. I: the 
number of stimuli was known and constant for a 
given subject. Exp. II: the number of stimuli 
was variable for all subjects. 
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TABLE 1 


PERCENTAGE OF STIMULI REPORTED AND PERCENTAGE OF STIMULI REPORTED CoRRECTLY 
as A FuncTION oF THE NuMBER oF STIMULI PRESENTED 1N Exp. II at Two 
LuMINANCE-DuRATION COMBINATIONS 








Luminance-duration 


Number of Stimuli Presented 





Combination 
2 





3 





eS SS ee xs. 





Percentage of Stimuli Reported 











.32 mL-.01 sec. 90.3 88.0 92.7 88.2 88.2 85.2 82.1 

5.12 mL-.64 sec. 100 99.3 98.7 94.6 94.8 92.2 90.7 
Percentage of Stimuli Reported Correctly 

.32 mL-.01 sec. 87.7 79.3 76.7 64.3 44.0 34.0 34.7 

5.12 mL-.64 sec. 99.3 98.7 93.7 92.3 79.7 65.7 55.3 


























magnitude of the primary sensory 
events, which are controlled by the 
variables of intensity and time, from 
the mechanism of localization which 
is not affected by experimental manip- 
ulation of these same variables (1). 
This conclusion is seen to be valid 
for multiple as well as single stimuli. 
The data suggest that the localization 








* Percent of stimuli reported 


* Percent of stimuli reported correct/y 
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Fic. 3. Percentage of stimuli reported (0) 
and percentage of stimuli reported correctly (x) 
as a function of the number of stimuli presented. 
The theoretical curve fitted to the number of 
stimuli reported correctly is based on the as- 
sumption that the perception of N stimuli will 
be the n** power of the probability of seeing one 
stimulus. . 


takes place some time after the 
primary sensory events are completed, 
and that if the primary excitation is 
great enough to produce a visible 
stimulus, the localization process fol- 
lows with a degree of accuracy which 
is independent of the magnitude of 
the primary excitation. 

The increase in the mean localiza- 
tion error as a function of the number 
of stimuli presented was found to hold 
both for Exp. I, in which the number 
of stimuli was constant and known to 
Ss, and for Exp. II, in which the 
stimulus number was variable:’ In 
either procedure, accuracy of local- 
ization per visible stimulus decreases 
with an increase in the number of 
stimulus elements presented. 


SUMMARY 


1. Single and multiple stimuli were 
presented on the circumference of an 
imaginary circle for various lumi- 
nance-exposure duration combinations. 
The Ss estimated the radial position 
of the stimulus or stimuli which varied 
in number from one through seven. 

2. Radial localization accuracy is 
independent of the variables of lumi- 
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nance and duration, provided the 
stimulus or stimuli are visible. 

3. An increase in the number of 
stimuli presented increases the radial 
localization error per stimulus. The 
error function increases rapidly at 
first with an increase in the number 
of stimuli, and then more slowly, 
before approaching a limiting value. 

4. The assumption by Schlosberg 
that the perception of a stimulus is an 
independent event was found to hold 
for a .32 mL—.Ol-sec. luminance- 
duration combination but not for a 
5.12 mL—.64-sec. combination. In 
the former case, the assumption that 
the probability of detecting exactly N 
stimuli is the n“ power of the prob- 
ability of seeing one stimulus was in 
good agreement with the data for 
2, 3, and 4, stimuli but overestimated 
the results for the 5, 6, and 7 stimuli. 

5. The mechanism of localization is 
not limited by the magnitude of the 
primary excitatory processes, the 
latter being completed before the 
localization mechanism is operative. 


No explanation is offered for the 
observed increase in the localization 
error per stimulus as a function of the 
number of stimuli presented. 
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DISTANCE JUDGMENT BY THE METHOD 
OF FRACTIONATION ! 


JEAN PURDY AND ELEANOR J. GIBSON 


Cornell University 


The literature of space perception 
abounds in both experiments and 
theoretical discussions of the per- 
ceived size and distance of objects. 
The continuum of distance itself, how- 
ever, has seldom been specifically 
studied as a psychological dimension. 
That distance must be perceived ac- 
curately for objects to remain constant 
in their dimensions is a generally 
accepted proposition. Errors of con- 
stancy, indeed, have been attributed 
to errors in the estimation of distance 
(3; 5, p. 151). It is time to inquire, 
therefore, what kind of psychological 
scale for a dimension of continuing 
distance exists. Can an observer tell 
when far and near stretches of 
distance are equal, or when one is half 
or twice the other? Gibson predicted 
that constancy would hold for the 
distances between objects (2, p. 165) 
on the hypothesis that constancy is a 
property not merely of perceived 
objects but of perceived tridimensional 
space itself. Little evidence is avail- 
able on this question. Gilinsky (3, 
p. 473 ff.) had two Os bisect distances 
from 8 to 200 ft. long, starting from O. 
She predicted that physically equal 
units of distance would appear shorter 
as their distance from O increased. 
This result apparently was found in 
her data, but the small number of O’s 
used make further experiments desir- 


1 This research was supported in part by the 
United States Air Force under Contract No. 
33 (038)-22373 monitored by the Perceptual and 
Motor Skills Laboratory, Human Resources 
Research Center, Lackland Air Force Base. 
Permission is granted for reproduction, transla- 
tion, publication, use, and disposal in whole and 
in part by or for the United States Government. 


able, especially since the results of 
recent size constancy experiments 
indicate that constancy or even “‘over- 
constancy” (4) holds for objects at 
very long distances. It would appear 
reasonable in the light of such a result, 
that “‘constancy,” or better, objective 
accuracy, should characterize the 
judgments of all the physical dimen- 
sions of a surface like the ground. 

In the present experiment the 
nature of a psychological scale for 
distance was investigated by a method 
of fractionation. The O was asked 
to bisect or trisect distances along an 
imaginary line stretching over the 
ground from O to an indicated marker. 
A further group of Os was available 
whose judgments were corrected as 
they were made. These fractionation 
judgments had served as a training 
procedure in another experiment (1). 


METHOD 


The experiment took place in a quadrangle 
350 yd. long. The ground was mowed grass, 
intersected by three walks. Trees and buildings 
lined the sides, but the view to the end of the 
quadrangle was unobstructed. A 300-yd. 
stretch was chosen for O’s scaling operations. 
The O stood at one end of the quadrangle, 
accompanied by £. A bicycle and rider moved 
up or down the quadrangle, as needed, as an 
indicator of the division point which O was asked 
to find. The bicycle rider had a prearranged 
schedule of movements to follow and could be 
signalled by a whistle when O wanted him to 
stop. The area had been surveyed and had 
marks at 25-yd. intervals to guide the rider, but 
these marks were invisible to O. 

Each O made 10 fractionation judgments. 
Six of these were bisections and four were 
trisections. A “bisection” is the act of stopping 
the bicycle at half the distance covered by the 
stretch; a “trisection” is the act of stopping the 
bicycle at one-third the distance covered, the 
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TABLE 1 
PLAN OF FRACTIONATION JUDGMENTS 
Bicycle Approaching Bicycle Withdrawing 
mgt 
0. 0 
Judgment Stretches Stretches Stretches Stretches 
Bisected (yd.) Trisected (yd.) Bisected (yd.) Trisected (yd.) 
1 300 (300 to 0) 300 (0 to 300) 
2 150 (150 to 0) 150 (150 to 300) 
3 150 (300 to 150) 150 (0 to 150) 
4 75 (300 to 225) 75 (0 to 75) 
5 75 (225 to 150) 50 (25 to 75) 
6 50 (200 to 150) 75 (75 to 150) 
7 75 (150 to 75) 50 (100 to 150) 
8 50 (125 to 75) 75 (150 to 225) 
9 75 (75 to 0) 50 (175 to 225) 
10 50 (50 to 0) 75 (225 to 300) 

















judgment in the latter case being the first third 
or the nearest third of the stretch covered. 
Table 1 shows the actual stretches divided, and 
the order in which the fractionations were made. 
The plan was such that the true division points 
for the 10 judgments made up a series at 25-yd. 
intervals from 25 through 250 yd. as indicated 
in the left-hand column of Table 2. The Os 
were divided into two equal subgroups depending 
on whether the bicycle rode away from O or 
toward him. The order of judging, and whether 
O halved or trisected, varied between the two 
subgroups so as to keep within bounds the 
mileage required of the bicycle rider. The first 
three bisections, however, were comparable for 
withdrawal and approach trials. 

Before O was asked to make a given judg- 
ment, markers were set up to indicate the two 


_ ends of the stretch that he was to divide. The 


markers were plain white rectangles, unlabeled. 
In some trials, O’s own station point was the 
origin of the distance stretch indicated, so that 
only one marker was required. 

The O was brought to his station point, given 
his instructions, and asked to turn his back 
while the markers were set up. When these 
were in place and the bicycle at the starting 
point of the stretch to be divided, E asked O 
to turn and judge when the moving target had 
reached the division point of the specified 
fraction. The E£ signalled the rider when O said 
“Stop.” If O was not satisfied with his first 
setting, one further adjustment was permitted. 
He then faced the other way during the interval 
between judgments. The error was measured 
by the bicycle rider with the aid of a fixed 
reference point that was invisible to 0. There 
were 40 Os, half with the bicycle approaching 
and half with the bicycle withdrawing. 

The separate group of Os who were corrected 
made the same judgments, except that O was 2 


permitted to see the direction and extent of his 
error, and permanent yard markers labeled with 
the number of yards distant were set up at the 
appropriate division points as the judgments 
were completed. There were 67 Os in this 
group, of whom 53 watched the bicycle approach 
and 14 watched it withdraw. All Os were Air- 
men in basic training at Sampson Air Force Base. 


RESULTS 


The data obtained from the bi- 
sections and trisections are presented 
in Table 2. Group” mean constant 
errors (CE) in yards and SD’s are 
given for each of the 10 true division 
points of the stretches divided. A 
positive error means that O divided 
the stretch so that the segment nearer 
him was too large; a negative error 
indicates that O divided the stretch 
so that the segment nearer him was 
too small. (It may be noted that if 
the subject divided the stretch into 
optical extents of equal visual angle, 
the nearer segment, by optical geom- 
etry, would be much too small.) The 
length of the interval fractionated 
and its terminal points can be found 
by consulting Table 1. 

Accuracy of fractionation.—The first 
question concerns the relation of the 
judged half-way or third-of-the-way 
point to the actual division point. 
Figure 1 presents these data for the 
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TABLE 2 
Group Mean CE anp SD or Fractionation JUDGMENTS 
Without Correction With Correction 
Distance Target Approaching Target Withdrawing Target Approaching Target Withdrawing 
of True 
Point, Vd 
J ° Tr 

Nov of aes | sp Nov at aes | sp Nov af =" | sp Nov at | sp 
Judg yd. yd. | Judg-| vq | yd. | Jude-| yy yd. | Jude- |} ya. yd. 

ment ment ment ment 
25 10 —1.2*} 2.0 4 | —2.8*| 2.3 10 +2.3*| 3.7 a +4.1*| 3.6 
50 9 —1.1 3.7 5 | +0.7 2.6 9 —0.5 4.6 5 —1.7 3.3 
75 2 +5.9*} 11.1 3 +0.4 7 2 +2.1 8.7 3 +0.1 3.1 
100 8 +0.5 3.8 6 | +1.6 3.9 8 +1.5*| 3.4 6 +3.1*| 4.1 
125 7 —0.3 3.7 7 | +1.8*| 3.4 7 —1.1 4.8 7 +5.7*| 7.8 
150 1 |+21.6*| 19.0 1 —2.8 3.9 1 |+17.5*| 19.4 1 +7.6 | 16.9 
175 6 —2.0*| 4.3 8 | —0.4 6.1 6 —0.1 4.6 8 +2.4*| 2.7 
200 5 —1.0 8.1 9 —2.9* | 4.5 5 —7.4*| 11.8 9 |—12.4*| 6.4 
225 3 +6.6* | 12.6 2 —5.8 6.2 3 +6.4* | 15.1 2 |-—12.4*| 17.0 
250 + +1.6 9.8 10 | —4.7*| 7.4 + +6.2* | 10.4 10 —0.9 | 11.9 









































* CE significantly different from 0 (p < .05). 


main group of Os who were not given 
knowledge of their errors. The most 
striking feature of these results is the 
small amount of error for many of the 
judgments. Only for one act of 
fractionation out of 20 does the mean 
CE go above 63 yd. When OQ indi- 
cates one-half or one-third of a 
distance stretch, the fraction corre- 


sponds very well with the measured 
half or third. Nine of the 20 CE’s are 
significantly different from zero (see 
Table 2, uncorrected group), but even 
the largest is only 7.2% of the total 
distance fractionated. On the aver- 
age, the CE is 3.1% of the total 
distance fractionated. There is a 
tendency for the CE to be large when 
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DISTANCE JUDGMENT 


large distance stretches are fraction- 
ated. The average error (without 
regard to sign) is directly related to 
the total distance fractionated, in- 
creasing as the length of the stretch 
to be fractionated increases. It 
ranges from 3.2 for a 50-yd. bisection to 
21.2 for the 300-yd. bisection. 

The size and direction of the error 
appear to be affected by the direction 
of movement of the “target.”” When 
the two uncorrected subgroups are 
compared in the first three judgments, 
the means differ significantly at 150 
and 225 yd. The means for approach 
and withdrawal also differ on frac- 
tionations represented by points at 
250 yd. and 25 yd. However, these 
last two differences are not necessarily 
due to the direction of movement of 
the target alone. Other differences 
confounded with direction of move- 
ment for all judgments after the third 
were: (a) the judgments were made in 
different orders, (b) Os were not al- 
ways making the same kinds of 
fractionation of the same distance 
stretch (see Table 1). Thus the 
difference between approach and with- 
drawal CE’s for fractionations at 25 
_ and 250 yd. could have been due to 
any of these variations in procedure. 
In general, the error appears to be 
positive on approach trials but much 
less positive or negative on with- 
drawal trials. At the 150-yd. division 
point, for instance, 88% of the ap- 
proach group but only 53% of the 
withdrawal group made _ positive 
errors. 

No difference in the variability of 
fractionation was produced by the 
direction of target movement and the 
kind of accompanying fractionation 
except at 200 yd. where the SD for the 
approach condition was higher. 

A comparison of the accuracy of 
bisections and trisections can be made 
by comparing the mean CE’s of 
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halving 50-yd. stretches with the mean 
CE’s of taking one-third of 75-yd. 
stretches. By such a comparison, the 
size of the CE is no larger for division 
into thirds than for division into 
halves, though group variability is 
larger for the one-third judgments. 

Variability was also influenced by 
the distance from O of the stretch 
to be fractionated. Trisections of 75 
yd. can be compared for four stretches 
which begin at 0, 75, 150, and 225 yds. 
from O. As the distance of the 
stretch from O increases, the SD also 
increases. 

Indications of distance “‘constancy.” 
—It might be supposed that per- 
ceptual judgments of distance mag- 
nitudes are based wholly or in part on 
the visual angles subtended at the 
retina by the stretches of distance. If 
two physically equal stretches, one 
near and one far, were compared on 
this basis, the resulting relative sub- 
jective magnitudes would not be 
equal; the farther stretch would sub- 
tend a smaller visual angle and would 
appear less. In order to make it 
appear equal, the further stretch 
would have to be actually greater (a 
negative error in our experiment). 
Thus if O made fractionation judg- 
ments wholly or partly on the basis of 
such information, a negative CE 
would be the result. Gilinsky (3) 
may have assumed something like 
this in stating that “Perceived dis- 
tances are foreshortened. The per- 
ceived distance d increases with the 
true distance D but at a reduced and 
diminishing rate” (p. 462). The pres- 
ent data, however, do not substantiate 
such a hypothesis. The errors in 
general are positive—that is, O made 
the nearer segment too large in com- 
parison with the farther. And the 
high degree of accuracy in general 
indicates that distance judgment ex- 
hibits a kind of constancy analogous 
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to size judgments. The term /ength 
or depth constancy, as distinguished 
from width or height constancy, might 
be applied to O’s judgments. The 
fact is that a distance stretch be- 
ginning at his own feet can be matched 
fairly accurately with another stretch 
beginning at a distance from his feet 
along the same axis. 

In Fig. 2 the mean distance judged 
one-half has been plotted for the three 
distance stretches which began at O’s 
feet: 50, 150, and 300 yd. Judgments 
for apprvach and withdrawal trials are 
presented separately, as well as the 
means for the two combined. The 
45° line describes correct half judg- 
ments. Though points for the ap- 
proach condition diverge from accu- 
racy, the mean values lie extremely 
close to that line. There is a slight 
indication that as the distance to be 
halved increases, O’s subjective half- 
way point may move farther away 
from him so that the nearer segment 
is physically larger than the farther 
segment. 
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Fic. 2. The relationship between the point 
judged half-way between O and a marker, and 
the distance of the marker in yd. The line at 
45° on the graph represents accurate frac- 
tionation. : 


Figure 3 shows the error that 
resulted from the fractionation of 
distance stretches as the distance of 
those stretches from O increased. The 
points connected by solid lines rep- 
resent bisections of 50-yd. stretches. 
The error for each bisection is plotted 
at the true half-way point of the 
stretch bisected. For instance, the 
point at 125 yd. represents a bisection 
of a stretch extending from 100 yd. to 
150 yd. The dotted lines connect 
error values that resulted from divid- 
ing a 75-yd. stretch into intervals 
subjectively equal to one-third and 
two-thirds. 

The horizontal line drawn at zero 
error indicates the expected judgments 
if “constancy” for distance stretches 
were perfect. The dashed curve, on 
the other hand, describes the judg- 
ments expected from a hypothetical O 
(6 ft. tall) who fractionated the 
stretches by making the two fractions 
subtend (a) equal visual angles in the 
case of halving 50 yd., or (b) visual 
angles equal to one-third and two- 
thirds of the total angle subtended by 
75 yd. The obtained judgments are 
close to the line of objective accuracy 
and do not follow the hypothetical 
visual angle judgment curve. The 
judgments at the far stretches show 
some tendency to be negative. Since 
the error is small and since the two 
hypothetical curves are close together 
at the far stretches, it is not possible 
to conclude that the plotted points 
follow either one or the other curve at 
175, 200, and 250 yd. The points 
plotted at 25, 50, 100, and 125 yd. 
follow the “constancy” line. 

This demonstration suggests the 
interdependence of all spatial con- 
stancies. All the dimensions of a 
surface, both frontal dimensions 
(termed size) and the third dimension 
(termed depth or distance) show con- 
stancy of magnitude. Gibson’s more 











¢ 
Cc 
> 
2 
- 
F 
Fe 
C 
< 
c 
u 
L 
c 
\ 
( 
t 
: 


— nr ee oon 





of 
yf 


feOr 0 @ Que 


— Dm 1 


> 


bl |) ee | i ee | ee) 








DISTANCE JUDGMENT 379 























+5 
° thiciaaas 
2 .¢) 
. eee — 
> yt ad -0 
ro] oOo = 
5 ‘. 
3 -st eon 
= - 
2 -" 
a a 
we 7 
u -l0Fr Pe 
° / 
W / 
z Ps 
a -I5 F / 
= / 
’ 
/ UNCORRECTED GROUP 
/ 
-20F / @ TARGET APPROACHING S 
/ 
' © TARGET WITHDRAWING FROM S 
! 
’ we (72 ceeeees 143 JUDGMENT 
-25 i rn l l 1 1 l 1 L L 
1°) 25 50 75 100 125 150 175 200 225 250 


TRUE POINT OF DIVISION, YDS. 
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stretch into one-third and two-thirds (dotted line) as a function of the true division point in yd. 
The dashed curve represents the expected error if O were to divide the stretches so that the resulting 
fractions subtended appropriate fractions of visual angie. 


general statement may thus be ap- 
plicable—that “Scale, not size, is 
actually what remains constant in 
perception” (2, p. 181). 

The effect of correction on fractiona- 
tion.—So far, only the results of un- 
corrected Os have been considered. 
Another large group of Os whose 
errors were corrected as the judgments 
proceeded can be compared with 
them. Table 2 gives mean CE’s and 
SD’s for both groups. The corrected 
group had 12 of 20 CE’s significantly 
greater than zero, while only 9 of the 
uncorrected group’s were significantly 
greater. A ¢t test comparing the 
magnitude of the CE’s of the corrected 
and uncorrected groups shows that 
five of the CE’s are significantly larger 
(p < .05) in the corrected group (at 
125 and 200 yd. with bicycle with- 


drawing, and at 25, 200, and 250 yd. 
with bicycle approaching). The 
mean CE is in no case significantly 
larger in the uncorrected group. Cor- 
rection, therefore, does not appear to 
increase the accuracy of fractionation, 
which is, indeed, very good already. 
The fractionation with correction did, 
however, serve as an effective training 
procedure for later absolute estimates 
of distance (1). 


SUMMARY 


The present study is an application of a psy- 
chological scaling operation (fractionation) to 
perceived distance. A 300-yd. stretch of grass- 
covered flat ground was used for O’s judgments. 
A bicycle moved up or down this stretch, and O 
indicated his judgment by stopping the rider at 
what he considered the correct division point. 
Six bisections and four trisections were made, 
with different intervals and ground stretches so 
chosen that one of the true division points would 
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lie every 25 yd. from zero (O's station point) 
through the 250-yd. point. 
The results suggest the following conclusions: 


1. Observers can divide stretches of distance 
(up to 300 yd.) into halves or thirds with very 
good accuracy. Perceived magnitudes of dis- 
tance appear to correspond well with physical 
magnitudes of distance. 

2. There is no tendency for the judgments to 
follow a hypothetical curve which would result 
from their being based on the magnitude of the 
visual angle subtended by different distance 
stretches. 

3. Error is related to the direction of motion 
of the target; the CE tends to be positive as the 
bicycle approaches, and either less positive or 
negative when the bicycle withdraws. 

4. Fractionation of distances was not im- 
proved by correcting O’s errors, nor was varia- 
bility reduced. 
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EFFECTS OF COMPLEX TRANSFORMATIONS 
OF FEEDBACK UPON SIMPLE 
INSTRUMENTAL BEHAVIOR! 


CLYDE E. NOBLE AND IRVIN G. BROUSSARD 


Louisiana State University 


Research in human instrumental 
behavior has shown that knowledge of 
results or psychological feedback (3) 
is a major factor in the establishment 
and maintenance of response strength 
as well as proficiency or skill. Fol- 
lowing early qualitative work by 
Judd (5), Arps (1), and Thorndike 
(6), later investigators became in- 
terested in quantitative aspects of the 
cue, drive, and reward properties of 
the law of effect. In line drawing, 
for example, Trowbridge and Cason 
(7) demonstrated the dependence of 
skill upon specificity of feedback. 
When the direction and magnitude of 
error was reported, Ss’ responses were 
more accurate and goal scores were 
more probable than when E merely 
said “right” or “‘wrong.” 

Recently, Bilodeau (2) studied the 
influence of transformations of feed- 
back on the behavior of turning a 
_ micrometer to obtain a certain score. 
Linear equations of the form Y 
=aX +5 were employed to convert 
Ss’ true scores (X) into reported 
scores (Y). The arbitrary goal or 
incentive was a reported score of 
200, defined as a scale reading of 
.2 in. multiplied by a constant of 
1000. By appropriate choice of slope 
and intercept parameters in his design, 
Bilodeau caused all lines to intersect 
at the point Y = X = 200. Each 
S’s actual deviations from the target 
(X-200) were calculated and the mean 
performance plotted over 16 succes- 
sive practice trials. The results in- 

1 This paper is based on a master’s thesis in- 


vestigation by the junior author under the 
guidance of the senior author. 


dicated: (a) a family of stable error 
curves which approached a common 
asymptote as a limit, (b) a rate of 
approach to mastery which varied 
positively with the magnitude of the 
slope and negatively with the inter- 
cept, and (c) response variability 
which increased initially and then 
decreased as practice continued. 

A technical problem discussed by 
Bilodeau is the fact that his conditions 
entailed joint, inverse variation of the 
slope and intercept parameters. If 
either a or b had been held constant 
while the other varied, a fixed in- 
centive would have been logically im- 
possible and the tasks psychologically 
incomparable. Bilodeau concluded, 
therefore, that proficiency is positively 
related to the magnitude of the re- 
ported target deviations (Y-200). 
The present experiment represents an 
extension of Bilodeau’s study in that 
micrometer performance will be in- 
vestigated under nonlinear conditions 
of feedback. This is intended to ac- 
complish two primary objectives: (a) 
the resolution of the problem of 
multiple parameters, and (b) the 
systematic comparison of linear and 
curvilinear transformations. 


With respect to the first objective, we 
have chosen a parabolic equation of the 
general form Y = aX? for all experi- 
mental conditions. The mathematical 
properties of this function are such that 
the slope parameter (4) may be varied 
while holding constant the origin (log a) 
and goal intercept (unity). Three treat- 


ments, differing quantitatively in the 
manner in which X scores are converted 
to Y scores, are sufficient for a repre- 
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Fic. 1. Experimental design for studying 
the effects of curvilinear transformations of 


feedback. 


sentative analysis. The transformation 
equations are as follows: 


Group Variable Equation 

Minus Negative acceleration Y= X35 (1) 
Zero Constant velocity Y = X!© (2) 
Plus Positive acceleration Y= X‘® (3) 


Figure 1 shows the experimental 
design graphically. It will be observed 
that Group Zero is linear and equivalent 
to Y = X. This condition serves as a 
point of reference for evaluating the 
other two treatments. It represents the 
usual case of psychological feedback; 
i.e., true information. The reported 
scores of Group Minus are a negatively 
accelerated increasing function of their 
actual scores, whereas the reported 
scores of Group Plus are a positively 
accelerated increasing function of their 
actual scores. Note that the slopes of 
all three equations increase in the order 
shown and that the 4 parameters of 
Equations 1 and 3 are simple reciprocals 
of each other. 

With respect to the second objective, 
the present study will permit compari- 
sons between ordinary feedback and two 
symmetrical complex transformations. 
Equation 2 provides both a control 
group and a link with the Y = X condi- 
tion of the Bilodeau experiment, whereas 
Equation 1 should elicit behavior similar 
to that of his Y = .500X + 100 group.? 


2In order to facilitate comparisons with 
Bilodeau’s design, Fig. 1 has been constructed 


There is no linear parallel in Bilodeau’s 
study for our Equation 3, essentially 
because he did not employ a negative 
Y intercept. (An analogous equation 
might have been Y = 2.0X — 200, but 
this would have entailed large, negative 
reported scores for the early trials.) 
Considering the differences in initial 
reported scores and subsequent feedback 
rates of the three groups in Fig. 1, 
it may be expected that Group Plus will 
at first overshoot the target and then 
improve in accuracy by discrete cor- 
rections. Generalizing from 'Bilodeau’s 
results, Groups Zero and Minus should 
achieve proficiency more smoothly, with 
Group Zero definitely superior. 


MeETHOD 


Subjects—The Ss consisted of 96 male and 
female undergraduate students at Louisiana 
State University, assigned independently and 
without bias to the three treatment groups 
(n = 32). The male/female ratio for each 
condition was as follows: Minus = 22/10, 
Zero = 21/11, Plus = 18/14. None had previ- 
ous experience with the apparatus and all were 
tested singly. The Ss’ ages ranged from 18 to 
31 yr. with a mean of 20.7 yr. There were no 
eliminations. 

Apparatus and procedure—A 3-4-in. Sears 
Craftsman micrometer was mounted in a ply- 
wood screen in the manner described by Bilodeau 
(2). The S’s problem was to learn to turn the 
exposed thimble by an amount necessary to 
receive a numerical score of 200. The accuracy 
of S’s response on a given trial, therefore, was 
contingent upon £’s report of his performance on 
the preceding trials. The instructions ex- 
plained that 200 constituted the goal score, 
that size of score increased with the number of 
turns, that each trial would start from zero, 
and that a uniform type of response was required 
throughout practice. To remove conflicting 
visual cues and to simplify E’s recording task, 
our Ss were blindfolded with occlusion goggles 
and were required to use the left hand to 
manipulate the thimble. (In Bilodeau’s pro- 
cedure Ss were not blindfolded and used the 
right hand.) The vernier scale was read by E 
after each response or series of responses (trial), 
then reset to zero for the succeeding trial. 


on the basis of a target score of 200 rather than 
unity. This was achieved by the following 
simple modifications of Equations 1 and 3 in 
order to refer all curves to a common scale: 


Y = 200 (X/200)*5 (la) 
Y = 200 (X/200)** (3a) 
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COMPLEX TRANSFORMATIONS OF FEEDBACK 


Approximately 5 sec. elapsed between S’s 
response and £’s report, followed by 10 sec. 
before the next trial. All Ss were given 20 
practice trials to achieve the incentive. 

Another procedural departure from the 
Bilodeau study was the restriction of S’s first 
response to a half turn, rather than a full turn. 
A response of exactly 180° equals a true score of 
12.5 (in units of .001 in.). Thus, for all treat- 
ments 16 half turns were necessary and sufficient 
to achieve a goal score of 200. Of course, Ss 
were not informed of this fact and care was 
taken to isolate them from the remaining pool 
of Ss upon completion of the experiment. As 
can be seen in Fig. 1, this method of*standard- 
izing Trial 1 was required in order to provide 
ample opportunity for differential acquisition 
rates. Given an actual score of 16, the reported 
scores for the various groups would be as follows: 
Minus = 113, Zero = 20, Plus = .02. The so- 
called “practice trial” also served effectively to 
match Ss as to initial means and variances, 
since a single wrist motion constitutes a well- 
standardized response unit. The E£ recorded the 
X scores in terms of .001 in., then rounded to the 
nearest 10 before entering the appropriate 
transformation table. By this device all groups 
had 20 available Y scores between 0 and 200. 
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REsULTs AND Discussion 


Acquisition phenomena.—In order 
to observe the directionality of errors, 
it is necessary to plot the data in 
terms of algebraic deviations (X-200) 
from the goal score. Bilodeau cal- 
culated mean absolute deviations, but 
this was because his Ss made few 
responses greater than 200. Further- 
more, since skewness was an expected 
characteristic of the frequency dis- 
tributions, especially those of Group 
Plus, medians were computed instead 
of means. 

Figure 2 presents the error curves 
as a function of successive trials for 
the three transformation groups. The 
initial X scores are almost exactly 
equal. The means and SD’s (in 
parentheses) for the respective groups 


on Trial 1 are as follows: Minus = 17 
(3.0), Zero= 16 (3.8), Plus = 16 
(3.5). A simple analysis of variance 
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for 2 and 93 df was performed upon 
these data, resulting in an F ratio less 
than unity. The over-all mean of 
16.33 represents an overestimation of 
one half turn by 30.6%. 

Inspection of the performance char- 
acteristics shows that the error curve 
for Group Minus decreases at a much 
slower rate than that of Group Zero, 
which is negatively accelerated 
throughout. As Bilodeau found for 
his comparable groups, “transforma- 
tion” by Equation 2 is superior to that 
by Equation 1. However, the rate of 
acquisition for the Y = X condition 
in the present experiment is much 
faster than in Bilodeau’s. Among 
possible differentiating factors, the 
most likely one is that our Ss were 
required to make a 180° turn on 
Trial 1, whereas Bilodeau’s Ss were 
instructed initially to turn the thimble 
360°. This phenomenon may be at- 
tributed to the motivational effect of 
S’s reported distance from the goal 
following his first response.’ 

From Fig. 2 it is evident that the 
curve for Group Minus has an early 
positive acceleration followed by later 
negative acceleration. On the whole, 
the curve has a definite sigmoid form 
and closely resembles Bilodeau’s 
Y = .500X + 100 condition. The 
performance of Group Plus is con- 
sistent with expectation in that Trial 2 
reveals a large overshooting error 
(+300.5) followed by a small under- 
shooting error (— 52.8) on Trial 3.4 
There is a subsequent trend toward 


3Other procedural differences between the 
two experiments (such as rig'it vs. left hand, 
vision vs. nonvision, and military personnel vs. 
college students) are probably of little impor- 
tance in the light of available <vidence. A 
preliminary micrometer study in the L.S.U. 
laboratory employing the Y = .500X + 100 
transformation, a 360° Trial 1 response, the left 
hand, blindfolding, and college level Ss gave 
data which agreed very closely with those of 
Bilodeau. ; 

4A problem arose in this connection due to 
physical limitations of the apparatus. Since a 
3—4-in, micrometer permits only a 1-in, excursion 


C. E. NOBLE AND I. G. BROUSSARD 


diminishing oscillation, but the al- 
ternation is irregular and slightly 
asymmetrical in favor of under- 
shooting. 

The accuracy of Group Zero is 
superior throughout. From Trial 4 
onward, the mean median deviation 
scores for the respective groups are: 
Zero = — 1.45, Plus = — 2.16, Minus 
=-—18.55. Descriptively and by 
analogy to military gunnery methods, 
the behavior of Group Zero can be 
called a “iaying-on” technique, while 
that of Group Minus is characterized 
by “creeping” toward the target. 
Distinctively different from either of 
these is the “bracketing” procedure of 
Group Plus, where>y successive ob- 
servations of “strikes” permit the 
eventual “zeroing-in” of the target. 
More generally, the performances of 
Groups Zero and Minus may be 
termed a continuous approach, 
whereas that of Group Plus represents 
a discontinuous approach. 

Performance variability —A further 
comparison of our data with Bilo- 
deau’s can be made in terms of the 
SD’s of the X scores. In Fig. 3, 
SD’s of the true scores are plotted as 
functions of trials for the three trans- 
formation groups. The general trend 
is an increase in variability on Trial 2 
followed by a fairly uniform decrease 
to the limit of training. The mag- 
nitude and form of the variability 
function is different for each experi- 
mental treatment. Group Plus ex- 
hibits maximum variability on Trial 
2 which is almost a hundred times its 
variability on Trial 1. Although the 
SD’s drop rapidly in the following 
four trials, Group Plus is more vari- 





of the spindle from the zero setting, the maxi- 
mum possible X score is 1000 (1 in.). About 
31% of the Ss in Group Plus on Trial 2 turned the 
spindle to the limit, giving rise to indeterminate 
mean scores. Exactly 12.5% of Trial 3 re- 
sponses and about 3% of Trial 4 responses were 
of this class. After Trial 4 the problem did not 
exist, but it was necessary in the foregoing cases 
to record an arbitrary X score of 1000, 
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able than the others for the last half 
of the experiment. Groups Zero and 
Minus are comparable initially, their 
Trial 2 SD’s being about 30 times 
that of Trial 1. Group Zero’s vari- 
ability decreases with practice, but 
Group Minus undergoes a rise at 
Trial 4 which peaks two trials later 
and then gradually falls. All curves 
appear to be merging toward the 
same asymptote. It is notable that 
the frequency distributions become 
more symmetrical as the terminal 
trials are approached. 


In his analysis of the effects of linear 
score transformations upon micrometer 
performance, Bilodeau hypothesized that 
Ss respond initially as if they expected 
the functional relationship between re- 
ported and true scores to be Y = X, and 


that they subsequently extinguish this 
pre-experimental assumption “‘at a rate 
which varies with the magnitude of the 
transformation constants” (2, p. 420). 
To evaluate this, he tested the mean 
differences between each S’s aspired and 
achieved X scores on Trial 2 for sig- 
nificance. Of the five independent com- 
parisons, only two ¢ ratios were small 
enough to permit retention of a null 
hypothesis (p > .05). Bilodeau’s con- 
tention was not even supported for his 


Y=X group; with a p<.01, Ss 
receiving no transformation deviated 


maximally from prediction. 


We applied 


the same test to our data, calculating 
individual Trial 2 expectancies from the 
response and feedback on Trial 1. The 
mean difference for Group Plus was in- 
determinate due to apparatus restrictions 
mentioned above, but the obtained scores 
of Groups Zero and Minus were entirely 
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consistent with Bilodeau’s hypothesis. 
For df = 31, the p values for the Zero 
and Minus comparisons were > .60 and 
> .10, respectively. 

Even if one concludes, however, that 
the results of these two experiments are 
due to differential rates of extinction 
of an initial equivalence set, there still 
remains the problem of explaining the 
manner in which the extinction mech- 
anism operates. Presumably, Bilodeau 
would favor a _ counterconditioning 
theory. Our own preference is to em- 
phasize motivational rather than as- 
sociative factors in the micrometer 
situation. Assume that the magnitude 
and direction of the reported target error 
(Y-200) following a given trial con- 
stitutes an aversive state of affairs. By 
the principle of secondary reinforcement 
(4), any X score causing an apparent 
reduction of this discrepancy and better 
orientation toward the goal should tend 
to be repeated. Conversely, X scores 
that are wide of the mark should tend to 
be avoided. For this reason, the mathe- 
matical form by which knowledge of 
results is transmitted to S will largely 
determine his average rate of gain in 
proficiency over repeated trials. This 
interpretation of the role of feedback 
transformations regards the learning 
factor as a constant with incentive- 
motivation as the variable. This is 
consistent with special effects like the 
relatively slow acquisition of Bilodeau’s 
Y = X group, the initial overshooting 
and high variability of our Y = X*° 
group, and the superiority of Y = X!-° 
over Y = X°?5, 

The differences found in the present 
investigation can be attributed to a 
single experimental factor, but it would 
be erroneous to conclude that proficiency 
is a direct function of the size of the 4 
parameter. Differential acceleration 
components in psychological feedback 
affect human instrumental behavior in a 
complex manner. Our findings suggest 
that the informative or cue properties of 
knowledge of results should not be 
stressed at the expense of its drive and 
reward properties. The micrometer 
offers excellent opportunities for study- 
ing these problems, in addition to such 
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allied topics as success and failure, ego 
involvement, and level of aspiration. 


SUMMARY 


This paper has described the effects of 
curvilinear transformations of feedback upon 
the behavior of turning a micrometer while 
blindfolded to achieve a certain numerical score. 
Parabolic equations of the general form Y = aX?® 
were employed to convert each S’s true scores 
(X) into reported scores (Y). This procedure 
entailed manipulation of the factor of accelera- 
tion (positive vs. zero vs. negative), while 
holding constant both origin and goal intercept. 
Ninety-six Ss, divided equally into three experi- 
mental conditions, were differentiated as Group 
Minus (Y = X75), Group Zero (Y = X!) and 
Group Plus (Y = X**). All Ss were given 20 
practice trials to attain the goal score. 

In general, Group Zero (true information) 
showed superior accuracy throughout, followed 
by Group Plus and Group Minus in that order. 
The performances of Groups Zero and Minus 
were characterized by a continuous approach to 
the target, while Group Plus exhibited a dis- 
continuous approach. 

A tentative explanation of the role of feed- 
back transformations in instrumental behavior 
was presented. 
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DISCRIMINATION DECREMENT AS A FUNCTION 
OF TIME IN A PROLONGED VIGIL! 


PAUL BAKAN? 


New York University 


This experiment was designed to 
investigate the change in differential 
brightness threshold as a function of 
time, in a situation where S is required 
to respond overtly to irregularly 
presented variable stimuli differing in 
brightness from a regularly recurring 
standard stimulus. 

Mackworth (3, 4) found a decrease 
in the number of successful responses 
as a function of time spent at the 
Clock Test, a task requiring con- 
tinuous vigilance. In the Clock Test, 
S is required to respond overtly to 
irregularly and infrequently presented 
critical stimuli, i.e., double-length 
movements of a pointer in front of a 
flat white surface. The pointer jumps 
regularly once a second, making a 
single-length movement, with an oc- 
casional double-length movement oc- 
curring in place of a single-length 


1 This paper represents part of a dissertation 
submitted in 1953 to the Graduate School of 
New York University in partial fulfillment of the 
requirements for the Ph.D. degree. The writer 
wishes to acknowledge his indebtedness to 
Lawrence Karlin of New York University, who 
supervised the thesis. 

The research was done at the Training Re- 
search Laboratory of the University of Illinois. 
It was supported in part by the United States 
Air Force under contract No. AF 33(038)-25726, 
monitored by the Air Force Personnel and 
Training Research Center. Permission is 
granted for reproduction, translation, publica- 
tion, use, and disposal in whole and in part by 
or for the United States Government. A report 
of the contents of this paper was given at the 
1953 meeting of the Midwestern Psychological 
Association. 

The writer wishes to express appreciation to 
William E. Kappauf and Lyle H. Lanier of the 
University of Illinois for their invaluable aid 
during the course of this study. 

2 Now at Michigan State University. 


movement. The S is not required to 
make any overt response to single- 
length stimuli. The decrease with 
time in the number of responses to 
critical stimuli in such a situation 
may be interpreted in terms of an 
increase in threshold for the required 
discrimination in the course of time. 
In order for S to respond overtly to 
the critical double-length stimulus, he 
must discriminate between the single- 
length stimulus and the double- 
length stimulus. If the decrement in 
frequency of response is thought of in 
terms of an increase in threshold, then 
it should be possible to obtain a 
measure of decrement by taking re- 
peated threshold measures during a 
prolonged discrimination task. In 
this experiment the method of limits 
is used to measure thresholds over a 
period of time in a continuous bright- 
ness discrimination task, to see 
whether a decrement can be measured 
in terms of an increase in threshold. 


MeEtTHOoD 


Subjects.—Twenty paid Ss (10 male and 10 
female), ranging in age from 16 to 35 years, 
participated in this experiment. Each S was 
tested on two different days, generally during the 
same week. 

Test situation.—The S viewed a test area, 15 
min. of arc in diameter and 9 ft. away, located 
within a uniformly illuminated surround (2.6 
ft.-L.). The test area flashed regularly once a 
second, holding at a base brightness (2.1 ft.-l.) 
for 4 sec. and flashing to a standard brightness 
(3.1 ft.-L.) for the remaining } sec. During a 
trial the test area flashed to a level brighter than 
standard and S was instructed to push a button 
whenever he detected a stimulus brighter than 
standard. 

In order to study the course of the threshold 
change during the vigil, the brightness increment 
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required to evoke a response was determined by 
the ascending method of limits on each trial. 
When a trial was called for, a test stimulus of 
Brightness Level 1, the smallest brightness 
increment over the standard, was presented in 
place of the standard. If no response occurred, 
Brightness Level 2 was presented 5 sec. later, 
i.e., five flashes later, and so on with increasingly 
brighter test stimuli at 5-sec. intervals until S 
indicated seeing a brighter stimulus by pressing 
the button. 

The test stimuli ranged from 3.2 to 4.0 ft.-L. 
in a series of nine equal log steps. Trials were 
begun by £ at times which had been scheduled 
in advance. Test sessions were 1.5 hr. long and 
the distribution of trials in time was such that 
there were six equivalent 15-min. periods. Six 
trials were scheduled for each 15-min. period. 
The trials were irregularly spaced and came 
after waits of 50, 60, 65, 140, 185, and 350 sec. 
between the beginning of one trial and the 
beginning of the next. Four different 15-min. 
programs of waits were used. These programs 
were chosen at random from the possible 
arrangements of the six wait periods. The 
varied programs were used so that no two 
successive 15-min. periods would have the same 
wait sequence, and so that the wait sequences 
on the second day would be different from those 
on the first day for any individual S. 

Practice.—After the instructions to S, various 
test stimuli ranging down to S’s threshold were 
demonstrated to S. This was followed by a 
7.5-min. practice period, during which five trials 
were given under test conditions like those of 
the experimenta! session. A 5-min. rest period 
followed this practice period. 

Scoring.—Two types of score were obtained 
from the data collected during the one and one- 
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Fic. 1. The effect of time on the difference 
limen for each day. 


TABLE 1 


ANALYSIS OF VARIANCE FOR 
TuresHotp Data 











Source df —. F P 
Periods (P) 5 4.10 | 8.72 | .0O1 
Days (D) 1 13.54 | 7.78 | .05 
Ss 19 7.39 | 4.25 01 
PxXD 5 43) — 

P X Ss 95 39; — 
D X Ss 19 1.74 | 3.70 001 
PXD~xX Ss 95 47; — 

















half hour vigil: (a4) a measure of the mean 
brightness increment required to evoke a 
response for each 15-min. period, i.e., the mean 
difference limen,? and (5) a tally of errors of 
commission, for each 15-min. period, ie., a 
record of responses made when no brighter 
stimulus was presented. 


RESULTS 


Change in threshold—The change 
with time in the difference limen is 
shown in Fig. 1 for each day. The 
threshold increase was such that in 
order for a brighter stimulus to be 
detected, it had to be about 25% 
more intense late in the session than 
at the beginning. On the basis of the 
analysis summarized in Table 1, it is 
clear that the variance contributed by 
15-min. periods is highly significant. 

Difference between days.—The vari- 
ance contributed by days, when tested 
against the significant day-by-subject 
interaction, is significant at the 5% 
level. As Fig. 1 shows, performance 
during each 15-min. period was better 
on Day 2 than on Day 1. Clearly 
there was a practice effect which im- 
proved the general level of perform- 
ance on Day 2, but this effect does 
not seem to have significantly altered 
the decremental trend which is found 
for both days. 


3 In four of the 1,440 trials given, no response 
was evoked for the largest brightness increment 
(Level 9). These trials were scored as if a 
response had been evoked at Level 9. 
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DISCRIMINATION DECREMENT 


Individual differences—The vari- 
ance contributed by individual Ss was 
significant at the 1% level, indicating 
that in terms of over-all threshold 
scores some Ss were good at the task 
and others poor. Further, the S-by- 
day interaction was significant, in- 
dicating individual differences in the 
extent of change in performance from 
Day 1 to Day 2. 

An analysis of the difference be- 
tween “good” and “poor” Ss was 
carried out. Good Ss may differ from 
poor Ss in either or botn of two ways: 
they may differ because of differences 
in threshold for the required discrimi- 
nation, or they may differ because of 
differences in the course of threshold 
change with time. In order to deter- 
mine whether differences in the course 
of threshold change with time are 
related to differences in over-all 
ability, Ss were ranked according to 
ability on the basis of the mean thresh- 
old over all 15-min. periods on both 
days, and divided into two equal 
groups, good and poor. The effect of 
time on threshold is shown for each 
group in Fig. 2. The ability-by- 
period interaction was significant at 
the 5% level. This means that as- 
sociated with the difference in over-all 
ability is a difference in the course of 
threshold change with time. As can 
be seen from Fig. 2 the good Ss show, 
on the average, a partial recovery from 
decrement after about 45 min. at the 
task (difference between Period 3 and 
Period 5 significant at 5% level), 
whereas the poor Ss show, on the 


average, a continuing increase in 
threshold in the course of the test 
session. Mackworth (4) also found 


that the slope of decline of perform- 
ance with time was less marked with 
good Ss than with poor Ss. 

Errors of commission.—For each 
day there was a trend toward a 
decreasing number of commission 
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Fic. 2. Difference limen as a function of 
time for “good” and “poor” Ss classified on 
basis of over-all ability. First and second days 
have been combined. 


errors from the beginning to the end 
of the session. Also, there were fewer 
commission errors on Day 2 than on 
Day 1; the difference between days 
was significant at the 5% level. On 
Day 1 the mean number of commis- 
sion errors decreased from 13 in the 
first period to 6in the last period. On 
Day 2 there was a decrease from a 
mean of 8 in the first-period to 3 in 
the last period. Although 15 of the 
20 Ss made fewer commission errors 
on Day 2, the extent of the day-to-day 
change was markedly influenced by 
four Ss with large differences in com- 
mission error frequencies between 
days. 


Discussion 


The results show that a decrement can 
be observed by taking repeated threshold 
measures during a prolonged discrimi- 
nation task. Thinking of the decrement 
as a threshold change not only provides 
a setting for all decrement data but also 
leads to the assertion, supported in the 
literature, that if the discrimination is 
made sufficiently easy, then there should 
be little or no decrement in frequency of 
response in the course of time. 

A possible explanation of the increase 
in threshold with time is suggested by 
some observations of Pavlov (5, p. 307) 
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who noted a tendency on the part of 
dogs to fall asleep while in the condition- 
ing stand. Kleitman (2, p. 286) has also 
presented evidence which shows that 
monotony of surroundings is a means of 
producing drowsiness or sleep. 

In view of the monotony of the experi- 
mental situation of the present study, 
it seems plausible to assume that this is 
the type of situation which would tend 
to induce a state of drowsiness. This 
assumption seems justified in view of the 
reports of Ss in which monotony and 
sleep symptoms were frequently men- 
tioned. Of interest in this connection is 
the fact that it was possible to eliminate 
decrement in the Clock Test by having 
Ss swallow a Benzedrine tablet before 
the test session (3, 4). In the case of 
motivated human Ss, set to continue 
working at the task, the tendency to 
drowsiness and sleep may be fought by 
an attempt on the part of S to change the 
situation psychologically in order to 
to reduce the monotony. This may be 
done by means which reduce attentive- 
ness to the discrimination task (e.g., 
daydreaming) and thus impair discrimi- 
nation performance. Reports from Ss 
on their activities during the test session 
support this explanation. Broadbent 
(1) has recently formulated a similar 
analysis of watchkeeping tasks in terms 
of attention. 


SUMMARY 


This study utilizes a method of repeated 
threshold measurement during a prolonged vigil 
in order to measure vigilance decrement. 


Twenty Ss were tested individually for 1.5 
hr. on each of two days. The S viewed a test 
area which flashed regularly once per second, 
to a standard (3.1 ft.-L.) brightness level for 
% sec. Test stimuli brighter than standard 
could be substituted for the standard by E at 
predetermined times. ‘Test stimuli ranged from 
3.2 ft.-L. to 4.0 ft.-L. in a series of nine equal 
log steps. The S responded by pressing a key 
whenever he saw a brighter flash. Test stimuli 
were used to measure threshold by the method 
of limits. During a trial, test stimuli were 
presented at 5-sec. intervals in an ascending 
series until an overt response was evoked. 

In the course of the test session the difference 
between the standard and the variable had to be 
increased in order for responses to be elicited. 
The extent of this increase represents a measure 
of vigilance decrement: This decrement is 
interpreted in terms of response to irrelevant 
stimuli as a means of reducing monotony in a 
monotonous situation. 
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THE NECESSARY CONDITIONS FOR 
CUE-POSITION 


DAVID BIRCH AND VICTORIA VANDENBERG 
University of Michigan 


Theory building in discrimination 
learning has a history of conflicts and 
confusions concerning the most fruit- 
ful way to define the effective stimuli 
in a discrimination learning problem 
(4, 6, 7, 9, 12). The present study 
adopts the analysis of Spence, who 
describes the total stimulus situation 
in terms of absolute and patterned 
stimulus components, and attempts to 
clarify the conditions necessary for 
the occurrence of cue-position pattern- 
ing (e.g., Black—Left) in a discrimina- 
tion learning problem. In his dis- 
cussion of the patterning of stimulus 
components Spence states, “Discrim- 
ination will involve such patterning 
according to this conception only 
when no one of the cue members is 
systematically reinforced more than 
the others” (10, p. 90). While the 
conditions stated by Spence may be 
very conducive for the occurrence of 
patterning, it may be that they are 
not necessary. 

The design of this investigation 
presented the animal with a discrimi- 
nation problem which could be learned 
on the basis of absolute components 
alone, but also allowed a test of the 
possible occurrence of stimulus pat- 
terning. The finding of stimulus 
patterning in this situation would 
indicate that Spence’s conditions are 
not necessary ones. 


MetTHop 


All Ss were first required to learn a discrimi- 
nation problem which paired Black on the Left 


with Gray on the Right (B G) and White on 

the Right with Gray on the Left (G W). In 

Table 1 this problem is shown as (B G) and 
G W) 
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designated Task 1. An analysis of the absolute 
stimulus components in the fashion of Spence 
shows Sp+, Sw+, Sc—, Sr+, and S_p+, and 
solution of the problem occurs when the com- 
bination of the approaching tendencies to Sp 
and Sz exceeds the combination to Sg and Sr, 
and when the combination to Sw and Spr exceeds 
that to Sg and S,. Despite the fact that com- 
binations of approaching and nonapproaching 
tendencies to these absolute components in the 
situation may account for the learning of the 
discrimination, it is possible that approaching 
tendencies to the patterned stimuli Spi and Swr 
and nonapproaching tendencies to Sg, and Ser 
could have developed. 

A series of transfer tasks using four groups of 
Ss were run to test for the presence of differential 
approaching tendencies to the absolute and the 
patterned stimuli. Table 1 summarizes the 
problems presented to Ss and provides analyses 
of the stimulus components for the learning 
tasks with and without patterning. 

The performance of Group A on Task 2 was 
contrasted with that of Group B on Task 3. If 
learning to the patterned stimuli has occurred in 
the original training, Group A is expected to 
demonstrate significantly more correct choices 
than Group B on the initial segment of the 
transfer problem trials; if learning has been to 
absolutes alone, both groups should respond with 
approximately 50% correct. Learning to the 
absolute components during the original training 
should be manifested by significant differences 
in the initial number of correct responses be- 
tween Group C on Task 4 and Group D on 
Task 5, with Group C showing the much larger 
number of correct choices. 

Groups C and D were employed in a second 
test for learning with respect to patterned 
stimuli by training the two groups to criterion 
on Tasks 4 and 5 and then testing them on 
Task 6, which is the same as Task 2 for Group A. 
Group C should respond in the neighborhood of 
50% correct on Task 6 whether or not the 
learning in Tasks 1 and 4 involved patterning, 
whereas Group D should respond approximately 
50% correct if absolutes alone are involved but 
should be close to 100% correct choice if ap- 
proaching tendencies to Spt and Swr on Task 1 
and nonapproaching tendencies to Spr and Swi 
on Task 5 were acquired. 

Subjects—The Ss were 28 experimentally 
naive, male rats, 90 to 120 days old at the be- 
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TABLE 1 


Series oF LEARNING AND Test Tasks witH ANALYSES OF THE STIMULUS CoMPONENTS 
Wirs anp Wirnovut Patrerntnc ror THE LEARNING TASKS 











Analysi Analysis Includi 
Task Groups Pe my Se re.. 
(3 6G) 
B G Ss+, Sw, Se- Ss+, Sw+, Sa- 
1. A836 D Sr+, Spt Sr+, Sit 
= + S , Sw , x. nai 
(a ¢.) pL+, Swrt+, Sat—, Sar 
Learn 
(3 3B) 
B B 
ae A 
(w w) 
W W 
Test 
(3 B) 
B B 
3. B 
(w w) 
WwW W 
Test 
(cg B) 
G B Ss+, Sw+, Sa— Sp+, Swt+, Sa— 
4. Cc Sr+, Sit Sr+, Sit 
+ - S , Swit, Sar—, Scr—- 
(+ =) BpR+, Swit, Sat—, Scr 
Learn 
(6 3B) 
- ee Ss—, Sw—, Sct Sp—, Sw—, Sot+ 
5. D Sr+, Spt Sr+, Sut 
- +) Spr—, Swi—, Sat+, Se 
( 5s ) Sat+, Sacr+ 
Learn 
(3 B) 
B B 
6. c,D 
(w w) 
W WwW 
Test 














ginning of the experiment. Fourteen animals 
were of a pigmented strain derived originally 
from the Lashley stock and maintained by the 
Psychology Department at the University of 
Michigan, and 14 were hooded rats of the Long- 
Evans strain obtained from the Lurie supply in 
Chicago. Groups A and C each contained four 
Ss from the departmental stock and three from 
the Long-Evans strain; Groups B and D each 
contained three Ss from the departmental stock 
and four from the Long-Evans strain. No 
difference in performance attributable to strain 
was observed. Four additional Ss, two from 
each strain, were discarded because of stereo- 
typed position habits on the original or transfer 
tasks. 

Apparatus—The apparatus was an open 
elevated Y maze, 2 ft. high, placed upon a table 
4x5 ft. The table was 2} ft. from. the floor 
and located in the middle of a room 9} ft. X 9} 


ft. The experiment was conducted at night 
and illumination was provided by diffuse lighting 
from fluorescent lamps recessed in the ceiling. 
The runway was 32 in. long and 2 in. wide and 
led immediately to an enlarged choice platform 
about 8 in. in diameter. Removable stimulus 
stands of the same height as the runway and 
choice platform formed the arms of the Y. 
These stands were 10 in. long and 4 in. wide and 
were separated from the choice platform by a 
5-in. gap. The distance between the stands at 
the narrow portion of the arms was 7} in. and 
19 in. at the wide portion. The entire stimulus 
stand was painted either black, white, or gray, 
and throughout the experiment two stands of 
each brightness were used interchangeably. 
The runway and choice platform were painted 
the same gray as the gray stimulus stands. A 
small removable metal food cup could be at- 
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(Tose 1) (Tests 4 ond 5) (Tests 2, 5, ona 6) 
Fic. 1. Percentage of correct choices during the original learning, 


the transfer learning, and the test tasks. 


tached at the far end about 1 in. below the 
level of each stimulus stand. 
Adaptation training.—The Ss were placed on 
a 24-hr. feeding schedule for about 2 weeks 
prior to the experiment. During this time they 
were handled extensively and trained to walk 
along a narrow board placed between two chairs, 
and to cross a gap up to 5 in. from one chair to 
the other to obtain food. The food consisted of 
Miller’s Homogenized Dog Food pellets and the 
daily ration for all Ss varied from 8 to 12 gm. 
* depending on the size and need of the particular 
animal. The £ attempted to estimate such 
needs during the adaptation period and felt this 
variation to be more effective in controlling 
excessive loss of weight and overactivity than a 
standard amount for all Ss. Once determined, 
the amount of food given to any S was kept 
constant throughout the experiment. 
Pretraining on the apparatus—After the 
adaptation period all Ss were given at least two 
days’ experience, 5 trials each day, on the ap- 
paratus. The two stimulus situations in the 
original learning task, B-G and G-W, were 
presented an equal number of times and in 
random order. During this period, food cups 
with one pellet of food (approximately .07 gm.) 
in each cup were attached to both stimulus 
stands. Most Ss ran down the runway, ap- 
proached one of the stands, ate the pellet, 
crossed back to the choice platform and then 
to the other stand and ate the pellet there. No 
guidance was given to Ss who approached both 
stimulus stands readily. In this way E could 


determine position preferences from the number 
of times S selected either the right or the left as 
his first choice. The Ss who made but one 
choice were guided to the other side. Those 
who were more tense and hesitated in making 
any choice were guided alternately to both sides. 
These Ss were given a few more pretraining 
trials on the apparatus until they reacted without 
any hesitation to both stimulus stands. This 
behavior was used by E£ as a criterion of “readi- 
ness.” Analysis of the data showed that those 
Ss (N = 8) who received more than 10 trials of 
pretraining on the apparatus did not differ from 
the others. 

Discrimination training.—On the day follow- 
ing pretraining, Ss were presented with the 
original learning task (Task 1). Ten trials per 
day, 5 to each of the stimulus pairs, were given 
by a series of Gellerman orders (3). If S ap- 
proached the positive stand, he was allowed to 
eat one small pellet of food placed in the food cup. 
Immediately thereafter he was picked up and 
placed on a chair in the experimental room. If 
he approached the negative stand, he was 
picked up after he found neither food cup nor 
food at the end of the platform. No punish- 
ment was used, and no correction was permitted.! 
Usually four Ss were run at a time, all Ss com- 


1 One S in Group A who developed a position 
habit in the original learning task was allowed to 
explore both stimulus stands in both stimulus 
situations on 5 trials of the 15th day; he reached 
criterion within the following three days. 
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pleting one trial before the beginning of the next 
trial. If conditions necessitated the use of 
fewer than four Ss, the time between trials was 
kept comparable to that when four Ss were used. 
Criterion of learning was 20 successive correct 
trials on two successive errorless days. 

At the beginning of the discrimination train- 
ing, Ss were randomly assigned to one of the 
four transfer groups. Those who reached cri- 
terion in the original task were preseated the 
transfer task the following day. Criterion of 
learning for Groups C and D in transfer learning 
Tasks 4 and 5 was also two successive perfect 
days. Groups C and D were then placed in 
transfer Task 6. All tasks were run with one 
alternative rewarded and the other nonrewarded 
according to the schedule of Table 1. 


RESULTS 


The performance of each of the 
four groups in percentage of correct 
choices for the original learning and 
each of the transfer tasks is shown in 
Fig. 1. Each point is based upon 
10 trials, 5 involving black and 5 
involving white, averaged over seven 
Ss. The plots indicate no appreciable 
differences between the groups on the 
original learning but sizable differ- 
ences between Groups C and D on the 
transfer learning and between all 
groups on thetest trials. The slightly 
extended learning period for Group A 
on Task 1 is due to the position habit 
responses of one animal over Days 
7-14. 

Table 2 presents the mean number 
of errors per S per block of 10 triais 


for all four groups on the 20 test 
trials, and for Groups C and D on the 
first 20 trials of the transfer learning 
task, and summarizes the statistical 
evaluation of the differences between 
Groups A and B and between Groups 
C and D. The Mann-Whitney U 
test (8) was employed for all tests of 
significance. 

The test for learning to absolute 
components, the transfer learning 
task, showed that Group C made 
significantly fewer errors than Group 
D with the greater difference found on 
the G-B stimulus combination. The 
20 test trials used to test for learning 
to patterned components produced 
significantly fewer errors in Group A 
than in Group B and in Group D than 
in Group C. 

In fact, except for the Group B- 
Group C difference, which reaches 
only the .15 level of significance, 
analysis of the total error score on the 
two test trial days shows all differ- 
ences between pairs of groups to be 
significant beyond the .05 level. 
Thus, the ranking of the groups from 
least to most errors on the test trials 
may be taken with some confidence 
to be DACB. Examination of Table 
2 reveals that in all but one com- 
parison (A-D), the groups differed 
more on the W-W stimulus combina- 
tion than on the B-B, but in every 


TABLE 2 


Mean Numser or Errors PER S per Buiock or 10 Trias on Tii—E 20 Test TRIALS 
AND THE First 20 Triats or THE TRANSFER LEARNING Task ALONG 
witH SIGNIFICANCE LEVELS* For DiFFERENCES BETWEEN Groups 

















Test Trials | Test Trials Transfer Learning 
Group B-B | Ww-W | Total Group B-B | Ww-W | Total G-B | W-G | Total 
A 3.4 2.7 3.1 / 4.6 | 5.9 5.2 0.1 2.6 1.4 
B 4.4 8.4 6.4 D 0.1 1.0 0.6 8.9 4.4 6.6 
? 0.58 0.002 0.002 ? 0.006 | 0.002 0.000 | 0.000 0.42 0.002 








* The values for the significance level3 were obtained from Mann-Whitney U tests which were run on the 
number of errors during the 20 test trials and the first 20 trials for the transfer learning task. All probabilities 
are for two-tailed,tests. 
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comparison except B-C the direction 
of the difference between groups was 
the same in both stimulus com- 
binations. 

Differences between pairs of the 
four groups in the total number of 
errors made in reaching the criterion 
of two successive errorless days on the 
original task failed to reach the .10 
level of significance, either on the 
B-G or G-W stimulus combinations 
separately or on the two combjned. 


Discussion 


The results of the comparisons be- 
tween Groups A, B, C, and D on the 
test trials strongly suggest that response 
tendencies to cue-position patterns can 
be acquired by rats in a situation where 
the discrimination can be learned on the 
basis of absolute components alone. 
While this is contrary to Spence’s ex- 
pectation, to the extent that his state- 
ments concerning patterning are inter- 
preted as reflecting necessary conditions, 
it should be noted that the results fit 
nicely with the theory if patterning is 
introduced into the analysis. Thus, 
Group D would be expected to perform 
very well on the test problem because 
Spt and Swr had been consistently rein- 
forced on Task 1, and Spr and Swy con- 
sistently nonreinforced on Task 5 during 
training. Group A with Spr and Swr re- 
inforced during training on Task 1 should 
also show an initial advantage on the test 
but to a lesser degree than Group D 
because Spr and Swi were never pre- 
sented to Group A during training. 
Group C with all four patterns reinforced 
during training on Task 1 should perform 
in the neighborhood of 50% correct 
while the transfer of approaching re- 
sponse tendencies to the stimulus pat- 
terns, Spi and Swr, by Group B should 
yield less than 50% correct choices. 
The obtained percentages of 94, 69, 48, 
and 36 for the four groups indicate 
marked agreement with these expecta- 
tions. At the same time the perform- 
ance of Groups C and D on the transfer 
learning task reflects the operation of the 
response tendencies to the absolute 


components. 
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A further implication of the present 
study concerns the explanation of the 
faster learning of the so-called simul- 
taneous problem over the successive 
problem where S approaches one of the 
discriminanda (10, 12). In a Black- 
White simultaneous problem an analysis 
including patterned stimuli shows the 
differential approaching and not ap- 
proaching tendencies to Sg and Sw to be 
facilitated by differential approaching 
and not approaching tendencies to pat- 
terns of Sgr, Spr, Swr, and Swi. In 
contrast, the successive problem provides 
only patterned cues as the basis of the 
discrimination. It seems possible that 
the faster learning in the simultaneous 
problem may be attributable, at least in 
part, to the larger number of cues avail- 
able for discrimination in this situation. 

The results also raise the question of 
possible other patterning of stimuli (e.g., 
transverse patterning) in simple dis- 
crimination learning problems. Extra- 
polation of the present evidence would 
tend to indicate that the performance of 
animals in a discrimination task is 
determined not by approaching and not 
approaching tendencies either to absolute 
or to patterned stimuli alone but rather 
by a function of weighted tendencies to a 
variety of stimulus elements, both abso- 
lute and patterned. The relative weights 
of the tendencies may well be expected 
to change as a function of changes in the 
stimulus situation. 

The adequacy of accounting for the 
data in terms of responses to whole con- 
figurations or to relations between stimuli 
may be examined. One suitable possi- 
bility for the original learning might be: 
“If the whole is darker, choose the left; 
if the whole is lighter, choose the right.” 
This is comparable to Bitterman’s “‘con- 
figurational” patterning (12) (after the 
analysis of Gulliksen and Wolfle [4)), 
where within-pairs differentiation has 
not developed and Ss respond to the two 
stimuli as a “‘whole.”” While the transfer 
results of Groups A and B could be 
accounted for in terms of such an anal- 
ysis, this is not true of the performance 
of Groups C and D on the transfer 
learning tasks where the brightness of 
the “wholes” was not altered but the 
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positions of the components were inter- 
changed. 

Analysis in terms of within-pairs 
differentiation (Bitterman’s second level 
of relational learning) encounters diffi- 
culties in explaining the learning of the 
original task. If learning occurred solely 
on the basis of within-pairs differentia- 
tion, one would probably predict that the 
problem should be more difficult than 
actually found in this study, and also 
that many Ss would fail to meet criterion. 
Consistent responding to either the 
darker or the lighter of the two com- 
ponents would appear as position re- 
sponses to one or the other side and 
would be rewarded only one-half of the 
time. Even if Ss did finally learn the 
original task on some sort of relational 
basis, as approach to the extremes of a 
brightness gradient, such learning would 
be of no aid in Tasks 2, 3, and 6 where 
both stimuli presented were either black 
or white. 

Conceivably, the results of this ex- 
periment could be accounted for by 
assuming that both types of differentia- 
tion, between-pairs and within-pairs, 
were developed in the original learning 
task and that appropriate selection 
occurred in the transfer tasks. From 
Bitterman’s emphasis (11, 12) on the 
“qualitative distinctiveness” of the two 
levels of organization and from _ his 
labeling of one (configurational) as more 
“primitive” than the other (relational), 
one is led to believe that Bitterman 
restricts each to a particular kind of 
problem, the successive and simultane- 
ous, respectively. In more recent 
articles, however, Bitterman (1, 2) ap- 
pears to emphasize these restrictions less 
and to permit the possibility of the 
development of different levels of or- 
ganization in one and the same problem. 
The latter position would be more 
similar to the interpretation stated previ- 
ously regarding the weighted tendencies 
developed in a discrimination problem. 


SUMMARY 


Four groups of seven rats each were trained 
on a brightness discrimination proklem which 
could be learned on the basis of absolute com- 
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ponents alone, of patterned stimuli alone, or of 
both. A transfer learning task presented to two 
of the groups demonstrates the approaching and 
not approaching tendencies acquired to absolute 
components while a test task given to all four 
groups indicates the presence of response tend- 
encies to patterned stimuli. The latter result 
is interpreted to be in disagreement with the 
conditions required for patterning as stated by 
Spence. 

Analysis of the problem including response 
tendencies to both absolute and patterned 
stimuli as discrete stimulus elements is seen to 
account for the results while no simple con- 
figurational or relational analysis appears 
adequate to encompass all the data. 
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